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Applicant(s): Donoho el al. Group Art Unit: 1653 
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RESPONSE TO OFFICE ACTION DATED MAY 21. 2003 

Mail Stop A F 

Commissioner for Patents ^ //r . t 

Alexandria. VA 22313 />>, ^ 6* ^ 

Sir: : ^1lc ■ 

The Applicants acknowledge the receipt of the Office Action ("the Action") mailed on 
May 21,2003 (Paper No. 12), which has been carefully reviewed and studied. Reexamination and 
reconsideration of the application is requested in view of the following remarks. In order to facilitate 
the Examiners evaluation of the application. Applicants have attempted to address the rejections in 
Paper No. 12 in the same order in which they were originally raised. 

The response is timely filed, and Applicants believe no fees are due in connection with this 
response. However, the Commissioner is authorized to charge any required fees or credit any 
overpayment to Deposit Account No. 50-0892. 



RESPONSE 



I. Status of the Claims 

No claims have been cancelled. No claims have been amended. No new claims have been 

added. 

Claims 1.3, and 5-7 are therefore presently pending in the case. For the convenience of the 
Examiner, a clean copy of the pending claims is attached hereto as Exhibit A. 
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II. Rejection of Claims 1, 3 and 5-7 Under 35 U.S-C. § 101 

The Action first rejects claims 1,3 and 5-7 under 35 U.S.C. § 101. as allegedly lacking a 
patentable utility. Applicants respectfully traverse. 

Applicants pointed out in the response filed on February 1 3, 2003 ("the prev ious response" ) 
to the First Office Action in this case, which was issued on August 27, 2002 ("the First Action"), a 
sequence sharing nearly 1 007c percent identity at the protein level over an extended region of the 
claimed sequence is present in the leading scientific repository for biological sequence data (GenBank). 
and has been annotated by third party scientists who are wholly unaffiliated with Applicants as 
"ADAMTSL3" (GenBank accession number AF237652; alignment and GenBank report shown in 
Exhibit B) The Examinerquestioned this assertion of utility because '"nearly 100% identity over and 
extend region' (sic) does not necessarily provide adequate support" for this assertion (Action at page 4, 
emphasis in original ). The Examiner further stated that "Vernet et al (US 20030059768 ) teach a 
protein that shares 99.97r identity with the sequence of SEQ ID NO:2 over the entire sequence" and 
that this "protein is described as (sic) Vernet et al. as a STE20-like trypsin inhibitor' (Action bridging 
pages 4 and 5, emphasis in original). There are a number a flaws in the Examiner's argument. First, 
while it is true that the alignment between SEQ ID NO:2 and the sequence disclosed in GenBank 
accession number AF237652 only covers 766 of the 1 69 1 amino acids of SEQ ID NO:2 (with 765, 
or 99 yy/ ( % identity), the AF237652 sequence is clearly annotated in GenBank as a " partial cds" ( see 
Exhibit B) and only contains 766 amino acids. How ever, examination of the manuscript referenced 
in the GenBank report for AF237652, Hirohata etal (which, although still indicated as "unpublished", 
in fact was published in J. Biol. Chem. 277: 12182- 12189, 2002; "Hirohata"; Exhibit C), clearly 
indicates that the full length ADAMTSL3 sequence is "1690 amino acids" in length (Hirohata at page 
1 2 1 87. second column). Thus, although the information in GenBank accession number AF237652 has 
not been updated to reflect the full length ADAMTSL3 sequence (or information about the Hirohata 
publication), the skilled artisan would readily recognize that Applicants' claimed sequence is, in fact, 
a full length ADAMTSL3 sequence. 

Second, Applicants are completely at a loss to understand why the Examiner seems to be 
accepting the assertion of utility set forth in U.S. Patent Application 20030059768 at face value ("(t)he 
protein is described as (sic) Vernet et al. as a STE20-like trypsin inhibitor" (Action bridging pages 4 
and 5)), while not accepting Applicants' assertion that the presently claimed sequence encodes an 
ADAMTS-like protein, even though third party scientists who are wholly unaffiliated with Applicants 



have confirmed Applicants assertion , as described above. Applicants respectfully remind the 
Examiner that the legal test for utility simply involves an assessment of whether those skilled in the ail 
would find any of the utilities described for the invention to be credible or believable. Given this 
GenBank annotation and the Hirohata manuscript, there can be no question that those skilled in the art 
w ould clearly believe that Applicants' sequence is an ADAMTSL3 sequence. The specification as 
originally filed details that the present sequence is associated w ith the extracellular matrix (at least at 
page 1 , line 27), which is confirmed by Hirohata (see Exhibit C). Given the well established biological 
relevance of ADAMTS-like proteins, those of skill in the art would readily appreciate the utility of the 
present sequence in numerous applications, as described below. Thus, the Examiner's argument fails 
to support the assertion that the present claims lack a patentable utility. 

As set forth in the previous response, as just one example of the utility of the present nucleotide 
sequences, the specification details on page 5, lines 2-4, that the present nucleotide sequences have 
utility in assessing gene expression patterns using high-throughput DNA chips. Such "DNA chips" 
clearly have utility, as evidenced by hundreds of issued U.S. Patents, as exemplified by U.S. Patent 
Nos. 5,445,934, 5,556,752, 5,744,305, 5,837,832, 6,156,501 and 6,261,776. As the present 
sequences are specific markers of the human genome (see below), and such specific markers are 
targets for the discovery of drugs that are associated with human disease, those of skill in the art would 
instantly recognize that the present nucleotide sequences would be an ideal, novel candidate for 
assessing gene expression using such DNA chips. Given the widespread utility of such "gene chip" 
methods using public domain gene sequence information, there can be little doubt that the use of the 
presently described novel sequences would have great utility in such DNA chip applications. Clearly, 
compositions that enhance the utility of such DNA chips, such as the presently claimed nucleotide 
sequences, must in themselves be useful. 

Evidence of the "real world" substantial utility of the present invention is further provided by the 
fact that there is an entire industry established based on the use of gene sequences or fragments thereof 
in a gene chip format. Perhaps the most notable gene chip company is Affymetrix. How ever, there are 
many companies w hich have, at one time or another, concentrated on the use of gene sequences or 
fragments, in gene chip and non-gene chip formats, for example: Gene Logic, ABI-Perkin-Elmer, 
HySeq and Incyte. In addition, one such company (Rosetta Inpharmatics) was viewed to have such 
"real world" value that it was acquired by large a pharmaceutical company (Merck) for significant sums 
of money (net equity value of the transaction was S620 million ). The "real world" substantial industrial 
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utility of gene sequences or fragments would, therefore, appear to be widespread and well established. 

Clearly, persons of skill in the art, as well as venture capitalists and investors, readily recognize the 

utility, both scientific and commercial, of genomic data in general, and specifically human genomic data. 

Billions of dollars have been invested in the human genome project, resulting in useful genomic data 

(see, e.g.. Venters <//., 2001 , Science 297: 1304). The results have been a stunning success as the 

utility of human genomic data has been widely recognized as a great gift to humanity (see, e.g., Jasny 

and Kennedy, 200 1 , Science 297:1 1 53 ). Clearly, the usefulness of human genomic data, such as the 

presently claimed nucleic acid molecules, is substantial and credible (worthy of billions of dollars and 

the creation of numerous companies focused on such information) and well-established (the uti lily of 

human genomic information has been clearly understood for many years). 

The Examiner questions this asserted utility, stating that such a use does "not constitute a 

specific or substantial utility' ( Action at page 4). The Examiner seems to be confusing the requirements 

of a specific utility with a unique utility. As clearly set forth by the Federal Circuit in Carl Zeiss 

Stiftung v. Remshaw PLC, 20 USPQ2d 1 101 (Fed. Cir. 1991): 

An invention need not be the best or only way to accomplish a certain result, and it 
need only be useful to some extent and in certain applications: "[T]he fact that an 
invention has only limited utility and is only operable in certain applications is not 
grounds for finding a lack of utility/* Envirotech Corp, v. AI George, Inc. , 22 1 USPQ 
473, 480 (Fed. Cir. 1984) 

The fact that other nucleotide sequences can be used to track gene expression does not mean that the 
use of Applicants' sequence to track gene expression is not a specific utility. If every invention were 
required to have a unique utility, the Patent and Trademark Office would no longer be issuing patents 
on batteries, automobile tires, golf balls, golf clubs, and treatments for a variety of human diseases, such 
as cancer, just to name a few particular examples, because the uti li ty of each of these compositions is 
applicable to the broad class in which each of these compositions falls: all batteries have the same utility, 
specifically to provide electrical pow er; all automobile tires have the same utility, specifically for use on 
automobiles; all golf balls and golf clubs have the same utility, specifically for use in the game of golf; 
and all cancer treatments have the same utility, specifically, to treat cancer. However, only the briefest 
perusal of virtually any issue of the Official Gazette provides numerous examples of patents being 
granted on each ofthe above compositions nearly eveix^ek- Furthermore, if a composition needed 
to be unique to be patented, the entire class and subclass system would be an effort in futi lity, as the 
class and subclass system serves solely to group such common inventions, which would not be required 
if each invention needed to have a unique utility. Thus, the present sequence clearly meets the 
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requirements of 35 U.S.C. § 101. 

Although Applicants need only make one credible assertion of utility to meet the requirements 
of 35 U.S.C. § 101 (Raytheon v. Roper, 220 USPQ 592 (Fed. Cir. 1983); //; re Gottlieb, 140 
USPQ 665 (CCPA 1964); In re Malachowski, 189 USPQ 432 (CCPA 1976); Hoffman v. Klaus. 
9 USPQ2d 1657 (Bd. Pat. App. & Inter. 1988)), Applicants pointed out in the previous response that 
as a further example of the utility of the presently claimed polynucleotide, as described in the 
specification at least at page 9, line 37. the present nucleotide sequence has a specific utility in 
determining the genomic structure of the corresponding human chromosome, for example mapping the 
protein encoding regions. This is evidenced by the fact that SEQ ID NO: 1 can be used to map the 29 
coding exons on chromosome 1 5 (present within four overlapping chromosome 1 5 clones; GenBank 
Accession Numbers AC027807, AC022684, AC 1 1 6 1 57 and AC087738; alignments and first page 
from the GenBank reports provided in Exhibit D). Clearly, the present polynucleotide provides 
exquisite specificity in localizing the specific region of human chromosome 1 5 that contains the gene 
encoding the given polynucleotide, a utility not shared by virtually any other nucleic acid sequences. 
In fact, it is this specificity that makes this particular sequence so useful. Early gene mapping techniques 
relied on methods such as Giemsa staining to identify regions of chromosomes. However, such 
techniques produced genetic maps with a resolution of only 5 to 10 megabases, far too low to be of 
much help in identifying specific genes involved in disease. The skilled artisan readily appreciates the 
significant benefit afforded by markers that map a specific locus of the human genome, such as the 
present nucleic acid sequence. 

The Action also questions this utility, again stating that such a use does "not constitute a specific 
or substantial utility" (Action at page 4). First, Applicants respectfully remind the Examiner that only 
a minor percentage (2-4 %) of the genome actually encodes exons, which in-turn encode amino acid 
sequences. Equally significant is that the claimed polynucleotide sequence defines how the encoded 
exons are actually spliced together to produce an active transcript (i.e. , the described sequences are 
useful for functionally defining exon splice-junctions). It is well-known that exon splice junctions can 
often be hot spots for erroneous events leading to cancer. The claimed sequences identify biologically 
verified exon splice junctions, as opposed to splice junctions that may have been bioinformatically 
predicted from genomic sequence alone. The specification also details that "sequences derived from 
regions adjacent to the intron/exon boundaries of the human gene can be used to design primers for use 
in amplification assays to detect mutations within the exons. introns, splice sites (e.i>.. splice acceptor 



and/or donor sites), etc.. that can be used in diagnostics and pharmacogenomics" ( specification at page 

10. lines 1-7). Applicants respectfully submit that the practical scientific value of biologica lly validated, 

expressed, spliced, and pol yadenylated mRN A sequences is readily apparent to those ski lied in the 

relevant biological and biochemical arts. Second, the Examiner again seems to be confusing the 

requirements of a specific utility with a unique utility. The fact that other nucleotide sequences can be 

used to identify exon splice junctions and map human chromosome 1 5 does not mean that these uses 

of Applicants' sequence are not specific utilities (Carl Zeiss Stiftung v. Renishuw PLC, supra). Thus, 

the present sequence clearly meets the requirements of 35 U.S.C. §101, 

It is important to note that it has been clearly established that a statement of utility in a 

specification must be accepted absent reasons why one skilled in the art w ould have reason to doubt 

the objective truth of such statement. In re Lunger , 503 F.2d 1380, 1391, 183 USPQ 288, 297 

(CCPA, 1974; "Lan}>er")\ In re Marzocchi, 439 R2d 220, 224, 169 USPQ 367, 370 (CCPA, 

1971 ). As clearly set forth in Lunger. 

As a matter of Patent Office practice, a specification which contains a disclosure of 
utility which corresponds in scope to the subject matter sought to be patented must be 
taken as sufficient to satisfy the utility requirement of § 101 for the entire claimed 
subject matter unless there is a reason for one skilled in the art to question the objective 
truth of the statement of utility or its scope. 

Lunger at 297, emphasis in original. As set forth in the MPEP, "Office personnel must provide 
evidence sufficient to show that the statement of asserted utility would be considered 'false' by a person 
of ordinary skill in the art" (MPEP, Eighth Edition at 2 100-40, emphasis added). Absent such evidence 
from the Examiner, the present claims clearly meet the requirements of 35 U.S.C. § 101 . 

The Action goes on to suggest that the claimed sequences lack uti lity because further research 
would be required in certain aspects of the invention. Even if, arguendo, further research might be 
required in certain aspects of the present invention, this does not preclude a finding that the invention 
has utility, as set forth by the Federal Circuit's holding in In re Brunei, (34 USPQ2d 1436 (Fed. Cir. 
1995), "BrancD, which clearly states that "pharmaceutical inventions, necessarily includes the 
expectation of further research and development " (Bruna at 1442-1443, emphasis added). In 
assessing the question of whether undue experimentation would be required in order to practice the 
claimed invention, the key term is "undue", not "experimentation". In reAngstadt and Griffin. 190 
USPQ 2 14 (CCPA 1976). The need for some experimentation does not render the claimed invention 
unpatentable. Indeed, a considerable amount of experimentation may be permissible il such 
experimentation is routinely practiced in the art. In re Angstadt and Griffin, supra; Amgen, Inc. v. 
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Chugai Pharmaceutical Co., Ltd., 18 USPQ2d 1016 (Fed. Cir. 1991 ). As a matter of law, it is well 
settled that a patent need not disclose what is w ell known in the art. In re Wands, 8 USPQ 2d 1400 
(Fed. Cir. 1988). 

Finally, the requirements set forth in the Action for compliance w ith 35 U.S.C. § 10 1 do not 
comply with the requirements set forth by the Patent and Trademark Office ("the PTO") itself for 
compliance w ith 35 U.S.C. § 101. While Applicants are well aw are of the new Utility Guidelines set 
forth by the USPTO, Applicants respectfully point out that the current rules and regulations regarding 
the examination of patent applications is and always has been the patent law s as set forth in 35 U.S.C. 
and the patent rules as set forth in 37 C.F.R., not the Manual of Patent Examination Procedure or 
particular guidelines for patent examination set forth by the USPTO. Furthermore, it is the job of the 
judiciary, not the USPTO, to interpret these laws and rules. Applicants are unaware of any significant 
recent changes in either 35 U.S.C. § 101, or in the interpretation of 35 U.S.C. § 101 by the Supreme 
Court or the Federal Circuit that is in keeping with the new Utility Guidelines set forth by the USPTO. 
This is underscored by numerous patents that have been issued over the years that claim nucleic acid 
fragments that do not comply with the new Utility Guidelines. As examples of such issued U.S. Patents, 
the Examiner is invited to review U.S. Patent Nos. 5,8 17,479, 5,654, 173, and 5,552,28 1 (each of 
which claims short polynucleotides), and recently issued U.S. Patent No. 6,340,583 (which includes 
no working examples), none of which contain examples of the "real-world" utilities that the Examiner 
seems to be requiring. As issued U.S. Patents are presumed to meet aH of the requirements for 
patentability, including 35 U.S.C. §§ 101 and 1 1 2, first paragraph (see Section III, below). Applicants 
submit that the present polynucleotides must also meet the requirements of 35 U.S.C. §101. While 
Applicants understand that each application is examined on its own merits, Applicants are unaware ot 
any changes to 35 U.S.C. § 101 , or in the interpretation of 35 U.S.C. § 101 by the Supreme Court 
or the Federal Circuit, since the issuance of these patents that render the subject matter claimed in these 
patents, w hich is similar to the subject matter in question in the present application, as suddenly non- 
statutory or failing to meet the requirements of 35 U.S.C. § 101. Thus, holding Applicants to a dinerent 
standard of utility w ould be arbitrary and capricious, and, like other clear violations of due process, 
cannot stand. 

For each of the foregoing reasons, as well as the reasons set forth in the prev ious response. 
Applicants submit that as the presently claimed nucleic acid molecules have been shown to have a 
substantial, specific, credible and well-established utility, the rejection of claims 1 , 3 and 5-7 under 
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35 U.S.C. § 101 has been overcome, and request that the rejection be withdrawn. 

III. Rejection of Claims 1. 3 and 5-7 Under 35 U.S.C, § 112, First Paragraph 

The Action next rejects claims 1, 3 and 5-7 under 35 U.S.C. §112, first paragraph, since 
allegedly one skilled in the art would not know how to use the invention, as the invention allegedly is 
not supported by a specific, substantial, and credible utility or a well-established utility. Applicants 
respectfully traverse. 

Applicants submit that as claims 1 , 3 and 5-7 have been shown to have kt a specific, substantial, 
and credible utility' 1 , as detailed in section II above, the present rejection of claims 1 , 3 and 5-7 under 
35 U.S.C. § 1 12, first paragraph, cannot stand. 

Applicants therefore request that the rejection of claims 1 , 3 and 5-7 under 35 U.S.C. §112, 
first paragraph, be withdrawn. 

IV. Conclusion 

The present document is a full and complete response to the Action. In conclusion, Applicants 
submit that, in light of the foregoing remarks, the present case is in condition for allowance, and such 
favorable action is respectfully requested. Should Examiner Snedden have any questions or comments, 
or believe that certain amendments of the claims might serve to improve their clan ty, a telephone call 
to the undersigned Applicants' representative is earnestly solicited. 

Respectfully submitted, 



August 21. 2003 
Date 



David W. Hibler 
Agent for Applicants 



Rec. No. 41.071 



LEXICON GENETICS INCORPORATED 
8800 Technology Forest Place 
The Woodlands, TX 77381 
(281) 863-3399 
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Exhibit A 

Clean Version of The Pending Claims in U.S. Patent Application Ser. No. 09/784,358 



1. (Previously Presented) An isolated nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1 . 

3. (Original) An isolated nucleic acid molecule comprising a nucleotide sequence that encodes 
the amino acid sequence shown in SEQ ID NO:2. 

5. (Previously Presented) A recombinant expression vector comprising the isolated nucleic 
acid molecule of claim 3. 

6. (Previously Presented) The recombinant expression vector of claim 5, wherein the isolated 
nucleic acid molecule comprises the nucleotide sequence of SEQ ID NO: 1 . 

7. (Previously Presented) A host cell comprising the recombinant expression vector of claim 5. 
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Punctin (ADAMTSL-1) is a secreted molecule resem- 
bling members of the ADAMTS family of proteases. 
Punctin lacks the pro-metalloprotease and the disinte- 
grin-like domain typical of this family but contains 
other ADAMTS domains in precise order including four 
thrombospondin type I repeats. Punctin is the product 
of a distinct gene on human chromosome 9p21~22 and 
mouse chromosome 4 that is expressed in adult skeletal 
muscle. His-tagged punctin expressed in stably trans- 
fected High-Five™ insect cells was purified to apparent 
homogeneity by Ni-chromatography of conditioned me- 
dium. The NH 2 terminus is not blocked and has the 
sequence EEDRD and so forth as determined by Edman 
degradation, demonstrating signal peptidase process- 
ing. Recombinant epitope-tagged punctin has a calcu- 
lated mass of 59,991 Da but exhibits major molecular 
species of 61970 ± 6 Da and 62131 ± 5 Da as measured by 
liquid chromatography electrospray mass spectrome- 
try. Punctin is a glycoprotein based on carbohydrate 
staining and liquid chromatography electrospray mass 
spectrometry glycopeptide analysis. Glycosylation oc- 
curs at a single N-linked site as demonstrated by altered 
electrophoretic migration of punctin expressed in the 
presence of tunicamycin A. Punctin contains disulfide 
bonds based on antibody accessibility and electro- 
phoretic migration under reducing versus nonreducing 
conditions. Rotary shadowing demonstrates that punc- 
tin is hatchet-shaped having a globular region attached 
to a short stem. In transfected COS-1 cells, punctin is 
deposited in the cell substratum in a punctate fashion 
and is excluded from focal contacts. Punctin is the first 
member of a novel family of ADAMTS-like proteins that 
may have important functions in the extracellular 
matrix. 
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Metalloproteases responsible for extracellular <ECMt l turn- 
over have a modular structure. Matrix metalloproteinases 
(MMPs) (1), a disintegrin-like and metalloprotease i ADAMs i 
(2 i, and proteases of the ADAMTS family ' 3, 4 ) are composed of 
characteristic domains arranged in a precise order that is the 
hallmark of each family. These enzymes are structurally and 
functionally bipartite consisting of an enzymatic domain at- 
tached to nonenzymatic or ancillary domains. The ancillary 
domains localize these proteases to substrates, the cell surface, 
or to the ECM. The ancillary domains of the gelatina>e> 
MMP-2 and MMP-9 are among the best studied of the sub- 
strate-binding domains. The fibronectin type II domains of the 
gelatinases are involved in binding to gelatin and some colla- 
gens as well as to fibronectin and heparin as in the case of 
MMP-2 (5, 6). The gelatin-binding domain of MMP-2 binds the 
matricellular proteins thrombospondin- 1 (TSP1) and TSP2 (7). 
Although neither is a substrate for MMP-2, the interaction may 
mediate the clearance of MMP-2 and affect cell-adhesive prop- 
erties (8). The MMP-2 hemopexin domain interacts with the 
carboxyl terminus of the tissue inhibitor of metalloproteases-2, 
facilitating pro-MMP-2 activation by membrane-type MMPs (1. 
5, 6, 9). The MMP-2 hemopexin domain also interacts with a 
chemokine called monocyte chemoattractant protein-3. which 
allows its processing by the catalytic domain (10). The dismte- 
grin domains of ADAMs such as ADAM- 15 are implicated in 
cell-cell adhesion (2, 11, 12), and the ancillary domains of 
ADAMTS- 1 are required for its binding to the ECM ( 13 i. In 
some ADAMs, the zinc-binding active site is nonfunctional, 
suggesting that they do not function as proteases at all but may 
instead have a primary role in adhesion via their ancillary 
domains (2 ). 

With this background, it is conceptually possible that gene 
products containing only the ancillary domains of ADAMTS 
may have specific functions in cell-cell or cell-matrix interac- 
tions or may regulate ADAMTS proteases. We have identified 
an ADAMTS-like (ADAMTSL) molecule named punctin. ~ 



1 The abbreviations used are: ECM, extracellular matrix; ADAMTSL. 
a disintegrin-like and metalloprotease domain with thrombospondin 
type I motifs like; ADAMTS. a disintegrin-like and metalloprotease 
domain with thrombospondin type I motifs; ADAM, a dismtegrm-hke 
and metalloprotease; MS, mass spectrometry; EST. expressed sequence 
tag. LC-ESMS, liquid chromatography -electrospray mass spectrometry; 
MALDI-TOF, matrix-assisted laser desorption ionization time-uf-llight, 
MMP. matrix metalloprotease; ORE, open reading frame; PBS, phos- 
phate-buffered saline; RACE, rapid amplification of cDNA ends; TSP. 
thrombospondin; TS, thrombospondin type I domain; HexNAc, 
A'-acetylhexosamine; NeuAc, A T -acetylneuramimc acid. 

-Approved gene symbols ADAMTSLl and Adamtsll indicate hu- 
man and mouse orthologs, respectively. The corresponding protein 
product of these genes, ADAMTSL-1, is designated by the trivial name 
punctin because of Us punctate distribution beneath transfected cells 
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which is the product of a gene distinct from any in the AD- 
AMTS family and is composed of ADAMTS ancillary domains 
alone. We have purified and characterized recombinant punc- 
tm produced in insect cells, visualized it by electron micros- 
copy, and demonstrated that it is a glycoprotein and a compo- 
nent of the ECM 

EXPERIMENTAL PROCEDURES 

cDXA Cloning and Sequence Analysis — Using BLAST programs 
iVum the National Center for Biotechnology Information, we scanned 
the data base of ESTs using the protein sequences of ADAMTS pro- 
teases previously cloned by us (4. 14) and identified a human EST 
GenBank™ accession number AA482392 encoded by IMAGE clone 
752797 (. The EST predicted a polypeptide with a similarity to the 
carboxyl half of cognate ADAMTS members but with no identities in 
GenBank™ or other protein and nucleotide data bases. 

Using nested oligonucleotide primers based on the sequences at the 
5' and 3' ends of the IMAGE clone insert and human skeletal muscle 
cDNA (Marathon cDNA, CLONTECH. Palo Alto, CA) as the template, 
we performed RACE and extended the cDNA at 5' and 3' ends by PCR 
essentially as described previously (4. 14). 

Xorthern Blot Analysis — Multiple tissue Northern blots from adult 
human and mouse tissues (CLONTECH, Palo Alto, CAi were hybrid- 
ized to a [a- 1J P]dCTP-labelcd punctin probe, a 1200-bp cDNA fragment 
from the 5' end of the punctin coding sequence, followed by autoradio- 
graphic exposure for 7 days. 

Chromosomal Mapping and Genomic Arrangement — To determine 
the chromosomal location of Adamtsll, we analyzed a panel of DNA 
samples from an interspecific cross that has been characterized for over 
1200 genetic markers throughout the mouse genome (15). Markers can 
be seen on the worldwide web Hvww.informatics.jax.org/searches/cross* 
datajorm.shtml ) by entering "DNA Mapping Panel Data Sets" from the 
mouse genome data base and then selecting the "Soldi n cross" and 
•Chromosome." Initially, DNA from the two parental mice, <C3H/HeJ- 
gld) and (C3H/HeJ-#/rf < Mus spretus) Fj), were digested with various 
restriction endonucleases and hybridized with the Adamtsll cDNA 
probe (IMAGE clone 2076907 with GenBank™ accession number 
AI787975) to determine restriction fragment length variants for haplo- 
type analyses. Gene linkage was determined by segregation analysis. 
Gene order was determined by analyzing all haplotypes and minimizing 
crossover frequency among all genes that were determined to be within 
a linkage group. This method resulted in the determination of the most 
probable gene order. To define the locus for ADAMTSLl, the human 
punctin cDNA sequence was used for BLAST searches of the human 
genome (Celera Sciences, Rockville, MD). 

Generation and Characterization of Anti-punctin Antisera — The pep- 
tide (NH^)-1C]YYPENIKPKPKLQE-(0H) located in the third TS do- 
main of punctin (Fig. IB) was synthesized using Fmoc {A r -(9-fluorcnyl) 
methoxycarbonyl) chemistry, purified by reverse-phase high-pressure 
liquid chromatography, and molecular weight was confirmed by MS 
(Alpha Diagnostic International, San Antonio, TX). A cysteine <[C|) 
residue was included at the NH 2 terminus for coupling to keyhole 
limpet hemocyanin. Peptide- keyhole limpet hemocyanin conjugate was 
di ;dy zed in PBS and used for immunization. Two New Zealand White 
male rabbits (7-8 pounds) were immunized with the conjugate (-200 
^g/injection/rabbit, multiple intramuscular and subcutaneous sites) at 
biweekly intervals for 8 weeks. After an initial injection in F round's 
complete adjuvant, subsequent injections were given in incomplete 
adjuvant. Antibody titer was measured by enzyme-linked immunosor- 
bent assay using free peptide. 

Immune sera were tested by Western blot analysis of extracts from 
COS-1 cells transiently transfected with punctin cDNA (see below). 
Although antisera from both rabbits (antisera 4112 and 4113) gave 
qualitatively similar results, the best signal/noise ratio was obtained 
with antiserum 4113. Affinity-purified antibodies were prepared by 
column chromatography of antiserum 4113 using the immobilized pep- 
tide immunogen. 

Expression and Purification of Recombinant Punctin from Insect 
Cells — High-Five™ cells (Invitrogen) were routinely cultured on tissue 
culture plastic and maintained at 27 C C in LTtimate™ serum-free in- 
sect cell medium (Invitrogen) as per manufacturers directions. The 
full-length punctin ORF was excised from pcDNA3. 1/Mye-His B-TSL1 
(see below) with£coRI and A T orI and ligated into the corresponding sites 
in pIZT/V5-His (Invitrogen). The resulting insect cell expression plas- 
mid pIZT/V5-His-TSLl generated punctin with a COOH-termmal V5 
epitope and 6x His tag. pIZT/V5-His-TSLl was transfected into High- 
Five™ cells using Insectin-Plus liposomes (Invitrogen) and plated onto 



100-mm Petri dishes. After 48 h, antibiotic selection -500 ^gml Zeocm, 
Invitrogen) was started and continued for 21 days. Colonies that sur- 
vived selection were picked manually, expanded, and maintained in 
medium containing Zeocin 1 50 ^g/ml 1 . Punctin production by isolated 
colonies was tested by Western blot analysis of conditioned medium 
using anti-His monoclonal antibody (Invitrogen 1 and antibody 4113 

For protein production, cells were grown in suspension in either 
Ultimate™ serum-free insect cell medium or Express-Five serum -free 
medium containing heparin (5 units/ml, Invitrogen i. Production cul- 
tures were in spinner flasks, and culture medium was stored at - SO ~C 
with 1 m.M phenylmethylsulfonyl fluoride until use. For purification, 
medium was dialyzed into binding buffer '20 m.M sodium phosphate. 
500 m.M NaCl. pH 7.8 i containing 0.03 f 'r Bnj-35 i Sigma'. Purification 
was performed using 1-liter batches of dialyzed medium and a 5-ml 
Ni-Sepharose column (ProBond IM , Invitrogen' on an fust protein liquid 
chromatography instrument (Bio-Rad, Hercules. CA 1 . Following bind- 
ing, the column was washed with three column volumes of binding 
buffer. A gradient of 0-42.5 mM imidazole in binding buffer was used to 
remove nonspecifically bound molecules from the column. Elution was 
with four column volumes of 250 mM imidazole in binding buffer, pH 
7.0. containing 0.03^ Brij-35. Elution was monitored by in-line L*V and 
conductivity measurements. 2-ml fractions of eluate were collected and 
tested by Western blot analysis as described above. Fractions contain- 
ing punctin were pooled. Protein concentration was determined using 
the Bradford assay (Bio-Rad) and by phenvlthiocarbamyl ammo acid 
analysis using an Applied Biosystems model 420H/130/920 automated 
analysis system (16). 

Characterization of Recombinant Puru tin— The NIL-terminal se- 
quence of recombinant punctin was determined by Edman degradation. 
Recombinant punctin i5 /xg) was electrophoresed on 10 r ; SDS-PAGE. 
electrotransferred to polyvinylidene di fluoride membrane, and lightly 
stained with modified Coomassie Blue (Simply Blue Safe Stain, Invitro- 
gen). Protein bands were excised and subjected to Edman degradation 
on an Applied Biosystems Procise 492 sequencer in the Molecular 
Biotechnology Core Facility of the Lerner Research Institute. 

To probe for glycosylation, recombinant punctin (4 ixgl was elect ro- 
phoresed on 10'? SDS-PAGE and stained for carbohydrate using a 
periodic acid-Schiff reaction-based method (Pro-Q fuchsia glycoprotein 
staining kit, Molecular Probes, Eugene, OR). In this reaction, Candy- 
Cane™ glycoprotein molecular weight standards consisting of alter- 
nate bands of glycosylated and unglycosylatod proteins were used as 
controls. Glycoprotein staining was also performed after enzymatic 
deglycosylation of punctin with peptide ;Y-glycosidase F Deglyeosyla- 
tion of denatured as well as native punctin was performed with a 
commercially available kit (Bio-Rad) using bovine fetuin as a control. To 
investigate further whether AM inked carbohydrates were present in 
punctin, stably transfected insect cells were cultured in the presence or 
absence of tunicamycin Al homolog (0.1 /mg/ml culture medium, Sigma:. 
Equal amounts of total protein from culture medium of tumcamycin- 
treated and untreated cells were assayed by Western blot with antibody 
4113 at various time points after the addition of tunicamycin 

Mass Spectrometry — The molecular mass of punctin was measured 
by MALDI-TOF and by LC-ESMS. MALDI-TOF was performed with a 
Per kin Elmer Biosystems Voyager DE Pro- mass spectrometer using 
sinapmic acid as the matrix and bovine serum albumin as a calibration 
standard protein (17). MALDI-TOF MS measurements of intact punctin 
and naturally observed limited proteolysis fragments are reported - 
f>() r ; peak width (in Da) at half-maximal peak height. LC-ESMS was 
performed with a PerkinElmer Sciex API 3000 triple quadruple mass 
spectrometer (17, 18). Nitrogen was used as the nebulization gas at 
40 p. s i., and curtain gas was supplied from a nitrogen generator (What- 
man model 75-72). For LC-ESMS of intact punctin, a scan range of 
700-1800 m/z was used with 0.2 atomic mass unit steps, a scan time of 
7.5 s. and at an orifice potential of 80 and 5000 V ion spray. Reverse 
phase-high-pressure liquid chromatography was done at a flow rate of 
5 ^il/min on a 5-^m Vydac C18 capillary column (0.3 x 150 mm, LC 
Packing) using an Applied Biosystems Model 140D high-pressure liquid 
chromatography system and aqueous acetonitnle/tnfluoroacetic acid 
solvents with 100'r of the eluant going to the mass spectrometer ESMS 
measurements of intact punctin are reported as the mean r S.E 
(in Da) 

For glycopeptide characterization, punctin was excised from a SDS- 
polyacrylamide gel i-l /ig/lane x 6 lanes), m-gel reduced with 10 mM 
dithiothrcitol. cysteine-alkylated with 20 m.M iodoacetamide in 400 mM 
ammonium bicarbonate, and digested with 0.2 u.g of trypsin (Promega' 
overnight at 37 ~'C in 100 mM ammonium bicarbonate Peptides from 
the m-gel tryptic digests were extracted with 60 r 7 acetonitnle contain- 
ing 0 r< tnfiuoroacetic acid, dried in a Speed Vac. redissolved in 50 ^1 
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of O.r; tnfluoroacetie acid, and analyzed by LC-ESMS using selective 
ion monitoring with the PE Sciex API 3000 triple quadruple mass 
spectrometer system as described above for intact protein analyses 
Glyeopeptides were selectively detected based on diagno-tic sugar oxo- 
nium ions HexNAc - Hex *tnlz 366.' and A'-acetylneuramimc acid 
XeuAc 292 ' 1 1 7 Carbohydrate marker ions at rn/z 366 and 292 
< dwell time 200 ms each > were monitored in a positive ion mode at a 
high orifice potential (180 Vt, whereas full scan?- at mlz 300-2300 1 0.2 
atomic mass unit steps, scan time 3.5 st were acquired at a lower orifice 
potential (70 Vi. This way both intact parent ions and abundant marker 
ions were observed in the same mlz scan. 

Rtftarx Shadowing and Electron Microscopy of Recombinant Punc- 
tin— Rotar\ shadowing was done essentially as described previously 
( 19'. A 30-^.1 sample of punctin at 100 ng/m\ was mixed with 70 ^.1 of 
glycerol and nebulized onto freshly cleaved mica using an airbrush. The 
sample was dried in a vacuum, and rotary shadowed using a platinum- 
carbon electron beam gun angled at 6 C relative to the mica surface 
within a Balzers BAE 250 evaporator. The replica was backed with 
carbon, floated onto distilled water, and picked up onto 600 mesh grids. 
Photomicrographs were taken using a Philips 410 electron microscope 
operated at 80 kV. 

Transient Expression of Tagged and Untagged Punctin in COS-1 
Cells — An internal Sad site and a flanking A'ofl site were used to 
remove a 1,5-kb fragment of IMAGE clone 752797 and ligate it into 
corresponding sites in IMAGE clone 2150669 corresponding to the 5' 
end of the punctin cDNA to generate a complete ORF. EroRI and Xotl 
sites flanking this ORF were used to excise and clone the full-length 
coding sequence into pcDNA3. 1/Mye-His < -r ) A (Invitrogen) for the 
expression of untagged punctin. To make constructs in which the AD- 
AMTSL1 ORF was in-frame with a carboxyl-terminal FLAG tag or a 
tandem myc tag and 6>: His tag, PGR was performed with Advantage 2 
polymerase (CLONTECH, Palo Alto, CA) using the full-length coding 
sequence as a template. The ampl icons were cloned into the vectors 
pFLAG-CMVSc (Sigma; and pt DNA3. 1/Myc-His B i Invitrogen) for ex- 
pression with either a COOH-terminal FLAG tag or a COOH-terminal 
tandem my<- tag and 6> His tag, respectively. 

COS-1 cells (ATCC number ORL-1650) were grown on tissue culture 
plastic in Dulbeeco's modified Eagle's medium:F-12 (1:1 1 (Lerner Re- 
search Institute Media Services) supplemented with KK* fetal bovine 
scrum (Invitrogen) and antibiotics (100 units/ml of penicillin and 50 
^g/ml streptomycin). 10* r> cells between passages 3 and 10 were trans- 
fected with untagged, FLAG-tagged, or myc -r 6x His-tagged punctin 
using FuGENE 6 (Roche Molecular Biochemicals) as per manufactur- 
er's recommendations, and cells were grown for an additional 24-48 h 
in serum-supplemented or serum-free medium. As a control, cells were 
transfected with the respective vector alone without insert. The me- 
dium was collected and concentrated 10-fold Cells were harvested after 
detachment with 10 m\l EDTA for 10-15 mm at 37 °C. A complete 
detachment of cells was confirmed by phase-contrast microscopy. Fifty 
microliters of 2> Laemmli sample buffer was added to the wells, and 
the ECM ^as scraped off. Samples of cell lysate. medium, and ECM 
were separately electrophoresed under reducing conditions (samples 
were boiled following the addition of 10' 7 ( (v/v) 2-mercaptoethanol) on 
12°} SDS-polyacrylamide gels and transferred to enhanced chemilumi- 
nescence (ECL)-Hybond tAmersham Biosciences, Inc.). Western blot- 
ting was performed using either anti-FLAG M2 antibody (diluted 1:500, 
Sigma), anti-His (COOH-terminal) antibody (diluted 1:1000, Invitro- 
gen) or antibody 4113 (diluted 1:300) depending on the construct used 
for transfection. Antibody binding was detected using the appropriate 
peroxidase- labeled second antibody followed by ECL using reagents 
from Amernham Biosciences, Inc. 

For immunocytochemistrv, COS-1 cells were grown on glass cover- 
slips in 35-mm diameter wells (in 6-well plates) and transiently trans- 
fected as described above in serum-supplemented or serum-free me- 
dium. The medium was removed 48 h after transfections. The cells were 
washed three times on ice with cold PBS containing 1 niM CaCl, and 1 
mM MgClj and incubated for 1 h on ice with 1 ml of c ulture medium 
containing anti-FLAG M2 monoclonal antibody (diluted 1:300. Sigma) 
or anti-punctin rabbit antiscra (diluted 1: 100 1 with gentle shaking. 
Cells were washed four times for 3 min each with cold PBS, fixed in 4'"< 
paraformaldehyde (w/v in PBS) (Sigma) on ice for 30 min with gentle 
shaking and then washed three times with PBS at ambient tempera- 
ture. To quench free aldehyde groups, cells were treated with 75 mM 
ammonium chloride, 20 mM glycine for 10 min at ambient temperature, 
washed with PBS, and then blocked with 0.05^ Triton X-100. 2^ 
normal goat serum in PBS (10 min at ambient temperature i. Finally, 
sections were incubated with the species-appropriate Texas Red-labeled 
goat secondary antibody (Jackson ImmunoResearch Laboratories, West 



Grove. PA' prior to covers] ip mounting in Vectashicld containing 4 \fc>- 
diamidino-2-phenylindole (Vector Laboratories, Inc., Burlingame. CA 1 . 
The following control-immunostaining experiments were performed. 
COS-1 cells transfected with the vector alone or untransfected COS-1 
cells were stained with the above antibodies, or transfected culls were 
stained with preimmune serum from the rabbits in which the polyclonal 
antibodies were produced. 

To co-stain punctin and the actm cytoskeleton, cells were stained 
with anti-FLAG or anti-punctin antibodies as described above with the 
exception that the secondary antibodies included incubation with Alexa 
488-phalloidin at recommended dilutions (Molecular Probes). In double 
immunostaining experiments following the immunolocalization of 
FLAG or punctin as described above, cells were permeabihzed with 
0.1 f "r Triton X-100 in PBS for 20 min prior to staining with ia/ mono- 
clonal antibody to vinculin (1:100 dilution, Sigma) in combination with 
antiserum 4113 for the detection of punctin or i6; polyclonal antibody to 
focal adhesion kinase (1:200 dilution, Upstate Biotechnology, Lake 
Placid. NT) in combination with anti-FLAG monoclonal antibody M2 
(Sigma) for the detection of punctin. A Texas Red-labeled antibody 
(Jackson ImmunoResearch Laboratories) was used for the detection of 
punctin, and Alexa 438-conjugated antibody (Molecular Probes > was 
used for the detection of vinculin or focal adhesion kinase. 

RESULTS 

Cloning of Punctin cDNA — We identified a novel EST (Gen- 
Bank™ accession number AA482392) derived from pooled hu- 
man melanocyte, fetal heart, and pregnant uterus with homol- 
ogy to ADAMTS proteases. The 1.5-kb insert of the 
corresponding IMAGE clone 752797 contained a long ORF en- 
coding an amino-terminal TS domain, a cysteine-rich domain, a 
cysteine-free spacer domain, and three tandem TS modules 
followed by a short acidic peptide and stop codon (Fig. la). The 
stop codon and 3 '-untranslated sequence were independently 
confirmed by 3 -RACE (clone pSHTSLls3, Fig. la) as well as by 
another EST (GenBank™ accession number YV47029). The 
3 '-untranslated region encoded in IMAGE clone 752797 con- 
tained a consensus polyadenylation signal (AATTAAA) fol- 
lowed by a poly(A) tail 14 nucleotides downstream. Completion 
of the full-length coding sequences by 5 '-RACE predicted a 
putative signal peptide upstream of the central TS domain , The 
signal peptide was preceded by a methionine codon within a 
satisfactory Kozak consensus sequence (A at -3, G at -4 
relative to ATG) (20) although there was no upstream in-frame 
stop codon. The 5' sequence obtained by RACE was subse- 
quently validated by independently cloned human and mouse 
ESTs (Genbank™ accession numbers A1459225 for human 
EST and AK020115 for mouse EST). The continuity of the 
cDNA clones was confirmed by PCR amplification of the full- 
length punctin ORF from human skeletal muscle cDNA (see 
below) as well as by identification of the encoding exons ar- 
ranged sequentially on human chromosome 9 (Celera Genom- 
ics, Rockville, MD). 

Primary Structure of Punctin Predicts an ADAMTS-like Pro- 
tein — The predicted full-length punctin protein contains 525 
amino acids and has the typical domain structure of the ancil- 
lary noncatalytic regions of an ADAMTS protease (Fig. la ). The 
mature secreted form of punctin is 497 amino acids with a 
molecular mass of 55,240 Da and a calculated pi of 6.2. Like the 
ADAMTS proteases, each domain in punctin has an even num- 
ber of cysteine residues. This observation suggests that each 
domain may have internal disulfide bonds (17 such bonds are 
predicted in punctin), and that punctin consists of a series of 
independently-folded and disulfide-bonded domains. Punctin 
contains no other domains apart from those described previ- 
ously in the ADAMTS family. The punctin sequence contains 
one motif for iV-linked glycosylation (21) at Asn" 1 (-Asn-X-Ser/ 
Thr-, where X is any amino acid except Pro) and also contains 
a total of 75 Thr and Ser residues, where O-linked glycosylation 
might occur. (Fig. 16). 

The overall punctin sequence is most similar to human AD- 
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Fir,, l a. domain organization of punctin/ADAMTSL-1 shown rela- 
tive to ADAMTS-1, the prototypic ADAMTS. The cloning strategy used 
for determination of the complete primary structure is shown. The 
location of each cDNA clone relative to the protein domains indicates 
the regions it encodes. The key to the domains is shown at the bottom 
of the figure, b, the predicted amino acid sequence of punctin is shown 
using the single-letter amino acid code. TS modules are underlined with 
the thick line and are numbered sequentially from amino to carboxyl 
terminus. A consensus sequence for AMinked glycosylation is overlined. 
Cysteine residues are indicated by asterisks. The start of the spacer 
domain is indicated, the region between the NH. 2 -terminal TS domain 
and the spacer domain is the cysteine- rich domain. The dashed line 
indicates the peptide used for the generation of antibodies. The arrow 
indicates the signal peptidase cleavage site. The arrowhead indicates a 
putative proteolytic processing site between TS domains 2 and 3. c, 
segregation of Adamtsll on mouse chromosome 4 in UC3WHcJ-gId x 
M, spretusi F l x CSWHcJ'gld) interspecific backcross mice. Filled 
boxes represent the homozygous C3H pattern, and open boxes represent 
the F, pattern. The mapping of the reference loci in this interspecific 
cross has been previously described (15). 

AMTSL-3 (68 r ; identity, sec below). Of the ADAMTS enzymes 
published to date, punctin is most similar to human AD- 
AMTS- 10 (35 r < identity). The punctin TS domains have a 
higher degree of similarity to other ADAMTS-like proteins and 
ADAMTS proteases than to TSP1 and TSP2. The greatest 
similarities, as indicated by percentage of identity of amino 
acid sequences identified by BLAST searches of the first TS 
domain of punctin to TS domains from various molecules, are 
as follows: human ADAMTSL-3, 80 r r; human ADAMTS-1. AD- 
AMTS-6, and ADAMTS- 10, 50Q; mouse papilin, 47?r; human 
ADAMTS-8, 44 r r; human ADAMTS-5, 42 r r; human TSP2, 40 r i\ 
human TSPl, 38<?c. Like most TS domains in the ADAMTS 
family, punctin TS domains do not contain linear peptide se- 
quences found in TSPl that have been defined as heparin or 
CD-36 binding sequences, (22). They do not contain degenerate 
GAG binding sequences such as BBXB, where B is the basic 
amino acid and X is any amino acid (22). 

Genomic Location of the Mouse and Human Punctin Genes 
and Tissue-specific Expression — The mapping of Adamtsll in 
an interspecific cross resulted in the following most probable 



gene order (mean z S.D.>: Ptprd-AA z 2.0 centimorgan-Ad- 
amtsll, Cdkn2a-l 8 z 1.2 centimorgan^/un and placed Ad- 
amtsll at a consensus position of 42.6 centimorgan on mouse 
chromosome 4 * Fig. L ■ in the vicinity of the interferon gene 
cluster. A search of the mouse genome data base iwww.mfor- 
matics.jax.org) did not reveal any pertinent genetic disorders 
near this locus. 

The human-mouse homology maps (www3.ncbi.nlm.nih.gov/ 
Omim/Homology/, accessed September 26, 2001) predict that 
the ADAMTS LI locus is on human chromosome 9p21-22. The 
predicted locus was confirmed by the analysis of the human 
genome sequence. The punctin ORF is encoded by 13 exons 
spanning >250 kb of genomic DNA mapping to 9p21. 2-22.1. A 
search of the Online Mendelian Inheritance in Man site 
iwww3.ncbi.nlm.nih.gov/Omim/) revealed three unsolved hu- 
man disorders in the vicinity of the ADAMTS LI locus. Diaph- 
yseal medullary stenosis with malignant fibrous histiocytoma 
(MIM 112250) is linked to 9p22-p21. Friedreich's ataxia 2 
(MIM601992) is linked to 9p23-pll, and neuropathy, distal 
hereditary motor, Jerash type (MIM605726) are linked to 
9p21.1-pl2. 

ADAMTS LI is primarily expressed in human and mouse 
skeletal muscle with a major message size of -7.0 kb in both 
species (Fig. 2). A minor messenger RNA species of -1.0 kb was 
also seen in some human tissues (Fig. 2, skeletal muscle, heart, 
colon, kidney, and liver). Expression was not detected in brain, 
colon, thymus, spleen, placenta, small intestine, lung, testis, 
ovary, or peripheral blood leukocytes. 

Expression and Characterization of Recombinant Punctin — 
Punctin expressed in High-Five™ cells with tandem COOH- 
terminal V5 and 6x His epitopes was secreted into the condi- 
tioned medium of adherent as well as suspension cultures. 
Punctin was detected by antibody 4113 and anti-epitope tag 
antibodies as a -60-kDa band under reducing conditions. It 
was substantially purified from the culture medium using Ni- 
chromatography (Fig. 3a I The purification scheme yielded a 
maximum of 200 jug/liter purified protein as determined by 
amino acid analysis. Electrophoresis and Western blotting of 
concentrated punctin preparations frequently demonstrated 
additional bands of molecular mass ( - 120 and - 180 kDa, data 
not shown), suggesting the formation of dimers and trimers at 
high concentrations. 

The conformation of punctin appears to be maintained by 
disulfide bonds as evidenced by more rapid migration in SDS- 
PAGE under nonreducing conditions than under reducing con- 
ditions (Fig. 3b). Furthermore, on Western blots under nonre- 
ducing conditions, the protein was not detectable with antibody 
4113 (data not shown), suggesting that the peptide epitope was 
not accessible without reduction of disulfide bonds. A mass 
analysis of His-tagged punctin by MALDI-TOF MS yielded a 
broad peak suggesting that the 60-kDa gel band contained 
major molecular species of 61,935 z 595 and 60,873 z 295 Da, 
respectively LC-ESMS analyses of the intact protein defined 
more precisely the major molecular species to be 61.970 z 6 
and 62,131 z 5, which are, respectively, 1979 and 2140 Da 
larger than the calculated mass (59,991) of tagged punctin 
based on amino acid sequence. NH 2 -terminal sequencing of the 
polyvinylidene difluonde-immobilized 60-kDa protein revealed 
a single sequence, which commenced at Glu J9 (i.e. Glu-Glu- 
Asp-Arg-Asp-Gly and so on). 

Recombinant Punctin Is Glycosylated— Two closely spaced 
punctin bands were resolved by Western blot analysis of con- 
ditioned medium or purified protein, although Coomassie Blue 
staining of purified punctin always demonstrated a single band 
(Fig. 3a). A periodic acid-SchifT-based method of staining car- 
bohydrate chains suggested that recombinant punctin is a gly- 
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Vic. 2. Northern analysis of expression of ADAMTSLl (left) and Adamtsll (right) in adult human and mouse tissues, respectively. 

Kilobase markers of RNA* are shown at the left of each autoradiogram. and tissue origin is indicated above each lane. Hybridizing transcripts are 
indicated by arrows. 
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Kit;. 3. Analysis of epitope-tagged punctin purified by Ni-chro- 
matography from insect cell culture medium, a, Coomassie Blue 
(Simply Blue Safe Stain) staining of purified recombinant punctin on 
reducing SDS-PAGE {left lane) and Western blot analysis with anti- 
punctin antibody 4113 {right lane). 6, Western blot analysis using 
anti-His tag monoclonal antibody on reducing {left lane) and nonreduc- 
mg SDS-PAGE \ right lane), c, glycoprotein staining of recombinant 
punctin ilane 2 contains 0 6 pig. and lane 3 contains 3 /xgi using the 
periodic acid-Schiff procedure. Glycosylated CandyCane™ markers 1 1 
^g/bandi stained similarly are in lane 1. The arnxv indicates stained 
punctin. d. Western analysis of culture medium from insect cell cultures 
treated without (left lane) or with {right lane) tunicamycin A for 72 h. 
Each lane contains 2.8 fig of total protein. Double arrowheads are used 
to indicate two molecular species seen on Western blots. 

coprotein (Fig. 3c), and mass spectrometry demonstrated mul- 
tiple molecular species consistent with variable glycosylation. 
Treatment of recombinant protein with peptide A r -glycosidase 
F did not result in a perceptible decrease in molecular mass, 
although the intensity of glycoprotein staining was decreased 
(data not shown). Culture medium from tunicamycin-treated 
cells exhibited only a single punctin species as demonstrated by 
Western blotting ( Fig. 3d). The difference (161 Da) between the 
LC-ESMS-observed masses of the major punctin molecular spe- 
cies (61,970 and 62,131 Da) is close to the m-chain chemical 
average mass of a oligosaccharide residue (Hex, 162). Minor 
molecular species were also apparent by LC-ESMS analysis, 
which differed by mass increments that approximated the in- 
chain chemical average mass of oligosaccharide residues {e.g. 
Hex, 162; HexNAc, 203; NeuAc, 291). For a further analysis, 
tryptic digests of the protein were examined by analytical LC- 
ESMS using stepped collision energy scanning to produce car- 
bohydrate-specific marker ions. Glycopeptides were detected 
including molecular species with masses of 5881.4 r 0.4 and 
6171.2 r 0.2 Da. The mass difference (289.8 Da) between these 
observed glycopeptides appears to correspond to the m-chain 
chemical average mass of A r -acetylneuraminic acid (NeuAc, 
291). Taken together, these data indicated that punctin is 
glycosylated, although specific glycopeptides have yet to be 



characterized fully. Approximately 65^ of the amino acid se- 
quence in punctin was identified by peptide mass mapping 
including the NH 2 -terminal tryptic peptide (Glu^'-Arg 4, ), ver- 
ifying that the target protein has been expressed. Based on the 
difference between the observed and calculated masses of in- 
tact punctin, the recombinant protein contains approximately 
3-4'r carbohydrate by weight. 

During purification of punctin in the absence of protease 
inhibitors, additional components of -40 and 20 kDa, respec- 
tively, were detected on Coomassie Blue-stained gels (data not 
shown). The 40-kDa band contained two molecular species with 
measured masses of 38,409 ±115 and 39,456 - 156 Da, re- 
spectively, as determined by MALDI-TOF MS. The NH 2 -termi- 
nal sequencing of these bands yielded the same amino termi- 
nus as the full-length punctin. The -20-kDa fragment 
exhibited an NH 2 -terminal sequence * 72 DLYHPL, indicating 
that the fragment is from the carboxyl terminus. The addition 
of 1 m.M phenylmethylsulfonyl fluoride to culture medium ef- 
fectively prevented this proteolysis, suggesting that it was ef- 
fected by a serine protease. 

Visualization of Punctin by Rotary Shadowing — Rotary 
shadowing of purified recombinant punctin demonstrated a 
hatchet-shaped or comma-shaped molecule 30-40 ^m in 
length (Fig. 4). Punctin consists of a single globular domain of 
10-20 p.m in size with a short linear segment at one end. Most 
of the visualized protein was in monomenc form (Fig. 4). Oc- 
casional aggregates with the appearance of dimers and tnmers 
were seen but have not yet been resolved in detail. 

Expression and Localization of Punctin in Transfected COS-1 
Cells — Transfected cells were stained without fixation or per- 
meabilization and on ice (live staining* to prevent the detection 
of intracellular punctin or endocytosed antibody, respectively. 
Under these conditions, punctin was localized underneath the 
cells (i.e. adjacent to their ventral surface) in the substratum 
laid down on plastic. The staining pattern was punctate (Fig. 5, 
a- d) and was preferentially located toward the periphery of the 
cells (Fig. 5, a, b, and c/> and under cellular processes (Fig. 5c). 
The punctin deposits were of submicron dimension, although 
fluorescent signals from closely located deposits were fre- 
quently merged suggesting larger aggregates. Transfected cells 
had minimal or no staining on the dorsal cell surface. Punctin 
was not seen in the substratum in areas not corresponding to 
the cells. If cells were detached with 10 niM EDTA prior to 
staining, "footprints" of transfected cells were retained on the 
substratum with a similar staining pattern as under intact 
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Fit.. 4. Rotary shadowing of recombinant punctin. a, overview 
/)-^ r . images of" individual punctin molecules. Scale bar in panel a 
indicates molecular dimensions in all panels. 

cells. Staining was seen in some areas not covered with cell 
processes. In other areas, there were cell processes without 
underlying punctin (Fig. 5c). We interpret this finding to result 
from cellular motility (i.e withdrawal of existing processes and 
the formation of new ones). Identical results were obtained 
with anti-FLAG monoclonal antibody or antibody 4113. Fig 5, 
a-c, shows staining of FLAG-tagged protein using the FLAG 
M2 monoclonal antibody, and Fig. 5d shows staining with 
anti-punctin antiserum 4113. Similar staining patterns were 
seen whether cells were grown in the presence or absence of 
serum and using tagged or untagged proteins (data not shown ). 

Double staining for vinculin (Fig. od) or focal adhesion ki- 
nase (data not shown), components of focal contacts, indicated 
that punctin staining did not correspond to sites of focal con- 
tacts. No staining was visible in control experiments, i.e in 
untransfected COS cells, cells transfected with vector alone, 
cells stained without a primary antibody, or cells stained w r ith 
preimmune serum as control. 

On Western blots, we found reactive protein bands of the 
expected size (58-60 kDa for untagged punctin and 62-64 kDa 
for the His-tagged or FLAG-tagged forms) in the medium, cell 
layer, and the underlying substratum or ECM of transfected 
COS-1 cells (Fig. 5e). In contrast, cells transfected with vector 
alone (Fig. r>e) or untransfected cells (data not shou r n) did not 
show a reactive band. As controls, preimmune serum from the 
rabbits in which anti-Punctin antibodies were generated did 
not produce immunoreactivity on Western blots (data not 
shown). 

DISCUSSION 

Punctin I ADAMTSL-1 Is a Novel ADAMTS-like Secreted 
Protein Belonging to a Distinct ADAMTSL Family of Pro- 
teins — In addition to missing the catalytic domain, the AD- 
AMTS-like proteins (see below) do not possess disintegnn-like 
domains. This finding suggests that the disintegrin-hke do- 
main and catalytic domain may represent a functionally cou- 
pled protease domain in ADAMTS enzymes. Further evidence 
for this comes from the identification of other proteins with a 
predicted structure similar to punctin. Following the complete 
cloning of punctin/ADAMTSL-1, we became aware of a second 
such molecule encoded by the KIAA0605 gene (GenBank™ 
accession number AB011177) that we designated as AD- 
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Fie 5. a-d, confocal laser-scanning microscopy of COS-1 cells follow- 
ing transient transfection with ADAMTSLl expression constructs and 
immunocytochemistry. Untransfected cells are visible in a and b. Scale 
bar (10 jj.m) is shown at lower right of each panel, a and 6, punctate 
staining of FLAG-tagged punctin (red) in nonpermeabilized cells visu- 
alized with anti-FLAG M2 antibody. Nuclei are blue 4',6-diamidino-2- 
phenylindole. c, relationship of punctin staining (red) visualized with 
anti-FLAG M2 monoclonal antibody to cellular actin as visualized by 
phalloidin staining (green ). The asterisk indicates a cellular protrusion 
that does not have underlying punctin, and the arrow indicates punctin 
immunolocalization without an overlying cellular process, d, relation- 
ship of punctin [red) visualized with anti-punctin antiserum 4113 to 
vinculin staining (green) as shown by confocal imaging and overlay of 
single-color images from a double-stained cell, e, Western blot analysis 
of cell lysates (lane 1), medium (lane 2), and ECM (lane 3) from trans- 
fected COS-1 cells using an anti-His tag monoclonal antibody. Cell 
lysates from untransfected COS-1 cells are shown in lane A, Molecular 
mass is indicated on the left. 

AMTSL-2 (23). We have cloned a third ADAMTS- like protein, 
ADAMTSL-3 (GenBank™ accession number AF237652>. a 
Therefore, punctin belongs to a distinct protein family. AD- 
AMTSL-2 and ADAMTSL-3 differ from punctin in their greater 
length (951 and 1690 amino acids, respectively) and also have 
more TS domains (6 and 10, respectively). These molecules will 
be described in greater detail in subsequent publications. In 
contrast to ADAMTSL-2 and ADAMTSL-3, which are quite 
widely expressed, 4 punctin/ADAMTSL-1 is selectively ex- 
pressed in muscle. 

Other secreted ECM molecules such as lacunin and papilin 
also contain the ancillary domains of the ADAMTS family in 
the precise order as punctin. However, punctin is more closely 
related to ADAMTSL-3 and some ADAMTS proteases than it is 
to mouse papilin (32 r r identity). Lacunin is a basement mem- 
brane glycoprotein in the moth Manduca sexta (24). Lacunin 
has the structure of ADAMTSL including seven TS modules as 



;i N. Moore, B. Anand-Apte. and S. Apte, unpublished data 
4 S. Apte. unpublished data. 



12188 



ADAMTS-like Protein 



weli as a single COOH-terminal protease and lacunin domain. 
In addition, it contains 13 repeats of a novel lagrin domain. 11 
Kunitz inhibitor domains. 2 antistasin-like domains. 1 serine 
protease inhibitor domain, and 2 immunoglobulin domains. 
Lacunin localizes to the basal lamina of the moth wing (24i. 
Papilin from Drusuphila melanogaster may be an ortholog of M. 
sexta lacunin, because the two molecules are similar in their 
domain content, organization, and primary sequence. Papilin is 
also a basement membrane protein (25). Although these inver- 
tebrate proteins have numerous protease inhibitor domains, 
mammalian papilin contains substantially fewer such domains 
(25). 

Characterization of Recombinant Punctin from Insect Cells— 
Our experimental data support the likelihood that recombinant 
punctin is disulfide-bonded. First, its electrophoretic mobility 
is greater under nonreducing conditions. Second, the punctin 
epitope is masked under nonreducing conditions. Third, rotary 
-hadowing demonstrated a molecule with a specific and con- 
Mstent conformation. Limited proteolysis within the linker 
peptide, connecting TS domains 2 and 3 assigned to the Tyr A ' 1 - 
Asp i7J peptide bond (Fig. 16) by a putative serine protease, 
indicates that there may be a proteolytically susceptible ex- 
posed region between the two disulfide-bonded TS domains. It 
is not yet known whether this is a physiologically relevant 
processing or whether it is an artifact that is unique to this 
expression system. The processing event releases the two 
OOOH-terminal TS domains of punctin. Because proteolyti- 
cally derived fragments of many secreted proteins have distinc- 
tive functions, it will be interesting to investigate whether 
specific functions are associated with the -40- and -20-kDa 
fragments. 

A mass measurement of epitope-tagged recombinant punctin 
by MALDI-TOF MS and LC-ESMS revealed that purified punc- 
tin contained multiple species of higher than the predicted 
mass. Edman degradation indicated that all these species had 
the same amino terminus. Further MS analysis, glycoprotein 
staining, and culture in the presence of tunicamycin A confirm 
that punctin contains AMinked sugars but do not exclude the 
presence of O-hnked sugar. Significant alteration of mobility 
was not seen after peptide 7V-glycosidase F treatment, suggest- 
ing that the iY-linked carbohydrate may be resistant to com- 
plete enzymatic removal (26). 

Rotary shadowing is useful for demonstrating the physical 
conformation of a molecule as well as the existence of oligo- 
meric complexes (27-29). The data we have obtained for punc- 
tin are relevant to the ADAMTS, lacunin, and papilin. They can 
be; extrapolated to represent the structure of the ancillary 
domains of an ADAMTS enzyme and the "papilin cassette" (25) 
and provide the first insight into the conformation of these 
domain assemblies. Many ECM proteins exist as oligomers. 
This observation may also be the case with punctin, because 
rotary shadowing electron microscopy and gel electrophoresis 
occasionally suggested the presence of dimers and trimers. We 
anticipate that rotary shadowing will be useful for future stud- 
ies to investigate punctin ohgomerization and interactions of 
punctin with putative ECM ligands. 

Punctin Is an ECM Glycoprotein That Binds to the Cell 
Substratum in a Spatially Specific Manner — Nontransformed 
cells in culture require a substratum for attachment, spread- 
ing, and migration. The substratum present on an unmodified 
plastic tissue culture surface is derived from the cells them- 
selves as well as from proteins in serum-supplemented culture 
medium (30-32). Quantitatively significant components of the 
cell substratum are laminin, fibronectin, vitronectin, collagen, 
tenascin, PG-M or versican (a chondroitin sulfate proteogly- 
can!, perlecan (a heparan sulfate proteoglycan), hyaluronan. 



and tissue inhibitor of metalloproteases-3 (30-37 Punctin 
shares the subcellular distribution of molecules that do not 
generally co-localize with focal contacts <v.g. versican, hyaluro- 
nan, and tenascin* (31, 37 i. Because punctin is left behind in 
the ECM after cell detachment with EDTA, we conclude that 
when expressed in COS-1 cells, punctin binds a component of 
the ECM. Punctin in culture medium may reflect an excess of 
more than that which can bind to the substratum or indicate 
secretion from the free surface of the cell. Punctin does not bind 
to ECM between the cells, indicating that the punctin ligand is 
absent from these regions. Because similar staining was seen 
under serum-supplemented as well as under serum-free cul- 
ture conditions, it is probable that the ECM binding partner of 
punctin is a molecule produced by COS-1 cells but not one 
derived from fetal bovine serum. 

Significance of Punctin and the ADAMTS-like F a nnly— Mol- 
ecules comprising ancillary domains of metalloproteases may 
be generated in biological systems by proteolytic processing or 
through alternative splicing of protease genes. Brooks et al. 
(38) found that the proteolytically generated hemopexin do- 
main of MMP-2 circulated in serum and bound to the mtegrin 
« v ftj. This MMP-2 fragment inhibited angiogenesis by prevent- 
ing membrane targeting of MMP-2 (38). So far, there are no 
known examples of ADAMTS-like proteins generated as splice 
variants of ADAMTS genes. The discovery of punctin demon- 
strates for the first time the existence of molecules closely 
resembling the ancillary domains of ADAMTS that are gener- 
ated as distinct gene products. 

The resemblance of ADAMTS L to ADAMTS suggest* a func- 
tional relationship between these two groups of molecules. 
From studies on ADAMTS- 1 (39) and ADAMTS-2 (40), it is 
known that the ancillary domains are required to bind and 
cleave substrates. ADAMTSL may offer a potential mechanism 
of ADAMTS regulation via one of several possible mechanisms. 
As a result of noncompetitive inhibition of ADAMTS-2, an 
inhibitory role has been shown for Drosophila papilin (25). 
Another possibility is that punctin may compete with ADAMTS 
for its substrates and protect the substrates from cleavage. The 
isolated MMP-2 hemopexin domain represents one such exam- 
ple. In a second example, a truncated nonenzymatic version of 
ADAM- 17 was shown to have a dominant negative effect on the 
activation of tumor necrosis factor-a (41). An intriguing possi- 
bility is that the ADAMTS-like proteins may be enhancers of 
the ADAMTS proteases. For example, the procollagen C-pro- 
teinase enhancer protein (.42) contains two domains homolo- 
gous to those found in the C-proteinase that are instrumental 
in binding to the carboxyl propeptide of procollagen I and 
enhancing its removal (43). Very little is currently known about 
the regulation of ADAMTS proteases following their activation, 
and it is possible that the ADAMTS-like proteins may provide 
a novel general principle of regulation. 

Acknowledgments— We thank C. Kassuba for editing the manu- 
script, Vincent C. Hascall, Tom Tallant, Seng Hui Low, and Thomas 
Weimbs for helpful discussion, members of the Aptc laboratory for 
critical reading of the manuscript. Satya Yadav for protein sequencing, 
Monique Ross for assistance with cloning, Karen West for ammo acid 
analysis, and Judy Drazba for help with confocal imaging. 

REFERENCES 

1 Murphv. G . and Knauper. V. < 1997 i Matrix Biol. 15, 511-518 

2. Blobel.'c. P. i 1997) Cvll 90, 589-592 

3. Kuno. K.. Kanada. N.. Naka.shima. E. Fujiki. F.. Ichimura. F .. and 

Matsushima. K. (1997)./ Biol. C/u-m. 212, 556-562 

4. Hurskainon, T. L , Hirohata, S.. Seldin, M. F. and Apte, S. S. '1999- J BuA. 

Chem. 274, 25555-25563 

5. Walion. L\ M . and Overall. C M '1997.-/ Bud Ch,m 272, 747J-74M 

6 Steffensen. B.. Bigg. H. F . and Overall, C M <199S> ■/ Bml Chvm. 273, 

20622-20628 

7 Bern. K . and Simons. M (2000i.7. Bio!. Chcm. 275, 32167-32173 

8 Bornstein. P.. Armstrong. L C , Hankenson. K. I) . Kyriakides. T K . and 

Yang, Z '2000) Matrix Biol 19, 557-568 



ADAMTS-like Protein 



12189 



9 .< herall. C M .. King. A E . Sam, D K . Ong. A D , Uu. T. T.. Wallon. U M . 
IX- Clerck, Y A., and Atherslone, J 1 1999' J. Biol. Chem 274, 4421-4429 

10 McQuibban, ti. A . Gong, J H . Tarn. E M . McCulloch, C A . Clark-Lvwi*. I , 

and Overall. C. M ■ 2000 ' Science 289, 1202- 1206 

11 Zhang. X. P . Kamata, T . Yukuvama. K , Puzon-McLaughhn. \V , and Takada. 

Y 199b J Biol Chun 273, 7345-7350 

12 Zulkiewska, A (1999- Exp. Cell Res 252. 423-431 

13 Kuno, K.. and MaUushima, K < 1 998 ■ J Biol Chem. 273, 13912-13917 

14 Geurgiadk-. K E , Hirohata. S , Seldin. M F . and Apte. S. S ■ 1999 G<nomu, 

62, 312-315 

15 Watson, M P.. and Seldin, M F - 1994 - Methods Mul. Genet 5, 369 -3b7 

16 Crabb. J W , West, K. A , Dodson. W S .. and Hulmes. J D ' 1997 in Cumnt 

Protocols in Protein Seienee 'Coligan, J. E., Ploegh. H L , Smith, -J A . and 
Speicher. I) W . eds pp 1 1 9 1-1 1 9 42, John Wiley &. Sun.-, Inc . New York 

17 Kaprun. J T . Milliard, G M . Lakms. J N . Tenniswuud. M P . WVst, K A . 

Garr, S A . and Crabb, J. W ' 1997' Pn.Uin Set. 6, 2120-2133 
lfv Crabb, J. W. Nie, Z, Chen. Y, Hulnies. J D , West, K, A , Kaprun. J T, 
Ruuska. S. E. Nov. N. and Saari. J C '1998' J /W Chun 273, 
20712-2O720 

19. Sakai, L Y . and Keene. II H : 1994> A/wWs Enzymul. 245, 29-52 

20 Kuzak. M. ' 1989* J. Cell Biol 108, 229-241 

21 Gavel, Y,, and von Heijne, G. ( 1 990 J Protein Eng. 3, 433-442 

22 Burnstein. P .. and Sage. E H t 1994 J Methods Enzymol. 245, 62-85 

23. Nagase. T .. L-hikawa. K,, Mivajima, N . Tanaka. A , Kotani. H.. Nomura. N . 
and Ohara, O ' 1998. DSA fits. 5, 31-39 

24 Nardi. J. B , Martos, R.. Walden. K K , Eampe, D J . and Robert.son, H. M. 

' 1999i fnseet Biochcm. Mol Biol. 29, 883-897 

25 Krameruva. I A , Kawaguchi, N., Fester. L I . Nelson, R E , Chen. Y . 

Kranu-rov. A A , Kusche-Gullberg. M., Kramer, J. M.. Ackley, B D , Sieron. 
A 1.., Prockop, I), J., and Fester, J. H. (2000) Development 127, 5475-5485 

26 Tretter, V , Altmann. F . and Marz, L, il99H Eur. J Biochem. 199, 647-652 

27 Holdeti, P , Meadows, R S , Chapman. K. L. Grant. M K.. Kadler. K. E , and 

Bnggs. M I) 1 200 1 ■ J. Biol Chem 276,6046-6055 
2-v Maddux'. B K . Sakai, L Y . Keene. I) R , and Glanville. R. W ' 19h9'J Bu>l 



Cht m 264, 2 13& 1-21305 

29 Ohn. A. I , Morgehn. M , Sasaki, T.. Timpl. R . Heinegard. D , and A>pberg. A 

■2001 -7. Biol Chem. 276, 1253-1261 

30 Havman. E G , Oldbtrg. A. Martin. G R, and Ruuslahii. K 19>2 ■/ L\i: 

Bio! 94, 2ft -35 

31. Havman, E G , Pu-r^hbacher, M I) . Suzuki. S , and R^.slahti. E 19^5 Exp 
Cell Res 160, 245-258 

32 Ohji, M , Sundar Raj. N , Haskell, J R. and Thuft. R A 1994 Ira^t 

Ophthalmol \':>ual Sn 35, 47y-4>5 

33 Snow. A D , Sekiguchi. R . Nochhn. i) . Eraser. P . Kimata, K , Mizuianj. A . 

Aral, M . Sehreier, W A., and Morgan. I). G ■ 1994 Wurun 12, 219-234 

34 Singer, I I . II. Soitt. S , Kawka, I) W , and Maxell . J R 19o7 Exp ( \ .7 R- > 

173, 558-571 

35 Isugai. Z . Shinun.ura, T , Yamakawa. N . Takeuchi. J . 'I'.-uji, T Heiiu-tjard . 

1) , and Kamata, K 1996 Carutr fit > 56, 3902-39U* 

36 Sta>ku>. P W . Ma.siarz. F. R , Pallanck, LJ, and Hawke<.S P ■ 1991 -J Bi»l 

Chem. 266, 449-454 

37 Yamagata. M , Saga, S , Kato, M . Bern field, M , and Kimata, K 1993 -7 

Set. 106, 55-65 

38. Brooke P. C , Silletti, S . von Schalscha. T L . Friedlander, M . and Cheresh. 

D A .1998 - TW/ 92, 391-400 
39 Kuno, K,, Terashima. Y., and Matsu.shima, K. (1999; J. Bud Chem. 274, 

1S82 1-18826 

40. (^olige. A.. Sieron, A. P.. Li. S W, Schwarze. L' , Pettv. E , Wertelfcki, W , 
Wilcox. W , Krakow, I)., Gohn. D. H , Reardijn, W , Byer>, \> H , Lapiere, 
C. M , Prockop. D. J . and Nu^ens, B V '1999' Am. -J. Hum (n-ru-t. 65, 
308-317 

41 Solomon. K A , Vvtti. N.. Wu, G.. and Newton. R C 1 1999- J Immunol. 163, 

4105-4108 

42 Takahara, K , Kessler, E.. Biniannnov, I... Brusel. M , Eddy, R E, Jani-Sait, 

S. Show*, T B., and (ireenspan, 0 S (1994) J Biol Chun 269, 
26280-26285 

43 Hutrne>. I). J , Mould. A IV. and Ke>>ler. E ' 1997' Mutnx fitol 16, 41 45 



Ape Senai » 09. '54 ibt- Erfurt C 

f+Dve- Mur-.an T^or-oosporxin Reoeat ^oteir,<> and Poiynucieot ***. EncoAng tr>e 



>.vC027807 .6.1. 181327 

Length - 181327 

Score - 211" bits (1063), Expect 
Identities - 1071,1072 (99%) 
Strand = Plus Xir.us 



Querv T : 2645 aaa tcaaa tca^aca ^gaaaacaaciact; tggtgagcagggt ccgcagatcc tcagtg tec i 

St ] ct : 1 0 42 4 0 aaa tcaaa tcagagat gaagacaaaac: tggtgagcagggt ccgcagatcc tcagtg tec 1 : j4 1 3 

Query : 2 7 05 agagagtctacat tcagacaagggaagagaagcgtattaacc tgaccattggtagcagag 2 7 64 

I M ! M I ! I I ! I I I ! M I M M M ! M i M I i i I ! ! M M M ; i i ; : I ! ! i ! : i M n : \ 

SI j cv : lO-tl.80 agagagtctacat tcagacaagggaagagaagcgtattaacc tgaccattggtagcagag -t^l 
r-uer.' : 2 7 6 5 cctat t tgc tgcccaacacatccgtgat tat taagtgce cc gegegae gat tec agaaa t 2-24 

I M I M I I I I M I M i M I I M I I I M M I I I II ! M I M I M I I I M I M II I I M M : 

St] c*. : 104120 cctatttgctgcccaacacatccgtgattattaagtgccccgtgcgacgattccagaaat l-:.'40o 
r-uprv : 2 82 5 etc tgat ccag tgggagaaggatggceg t tgcctgcagaac tccaaacggct tggcatca 2o34 

I I MM I I ! I ! I I I MM I I I ! I I I ! I !! I I I I I I I ! I I M I I M I I II ! I II M I I I I I 

Sb j ct. : 1040 6 0 ctctgatccag tgggagaaggatggceg t tgcctgcagaac tccaaacggct tggcatca 1.04 0 0 
<tu^rv: 2 33 5 ccaagtcaggc tcac taaaaatccacggtc t tgc tgcccccgaca teggegtg taceggt 2 94 4 

1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i 1 1 i i i I i Mil 

Sb jet : 104000 ccaagtcaggc tcac taaaaatccatggtct tgc tgcccccgaca teggegtg taceggt ^J394 
pupiv: 2 94 5 gcat tgeagge tctgcacaggaaacagt tgt get eaagc teat tgg tact gacaaccggc 3 0 04 

I I ! I I I I I I I I I I I I ! I I I I I I I I I i I ! I I I I I I I I i I I I M I I I I I I I I I I M I M I I ! 

Sbi c~. : 10 5 940 acat tacaggc tctgcacaggaaacagt tgt get caagc teat tgg tact gacaaccggc ^J3 3 3 



■ eer/ : 3 0 05 teat cgcacgcecagccctcagggagcc tatgagggaata tec tgggatggaceacagcg j<t'o< t 

i i I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I i I I I I I I I I I ! I I I I I I I I I I 

St jet. : 10 3 880 teat cgcacgcecagccctcagggagcc tatgagggaata tec tgggatggaceacagcg ^J332 
■"■u^rv : 3 0 65 aagecaatag t t tgggagtcacatggcacaaaatgaggcaaatgtggaataacaaaaacg J 12 4 

M I M M M M I M I I I II I I I M I I II M M M M M M I M II M M I M M M M M 

Sbjct : 10 382 0 aagccaatagtttgggagtcacatggcacaaaatgaggcaaatgtggaataacaaaaatg _u 37 b 



'■u^rv • 312 5 acctttatetggatgatgaccaeattagtaaccagcctttcttgagagetctgttaggec 3 18 4 

I I I ! I ! I I I I I I ! I I I I I I I I I I ! i I ! I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

rbj ct : 10 3760 acctttatctggatgatgaccacattagtaaccagcctttcttgagagetetgttaggcc j / 0 
eery : 3135 actgeagcaat tc tgcaggaagcaccaae tectg ggagt tgaagaataageagt t tgaag 3244 

M I I I I I I I I I I I i I I II M I I IS I I II M M I ! I I I M I I I II II M I II I I I M M M 

rt'H-ct: 10 3700 actgeagcaat tctgcaggaagcaccaae tcctgggagt tgaagaataag tagtt tgaag ^036^ 



;ery: 3245 cagcagttaaacaaggagcatatageacggatacagcccag^ ttgatgagctgatciagaa 3304 
: 113 64 0 ca^ragttaaacaaggagratatagcatggatacageccaatttgatgagrtgataagaa i J3 r>8 . 



:uerv : 3..i 0 5 



:a:gag:cacc:catggaaa :cggagagg:cagc:a:ga:c::gcg:cc:ac::ga:a: j : 



5 5 30 aca:gagtcagc: ca tggaaa z rggagagg tcagcgat gat ct tg eg tec :agc:ga tar 



r.ery : 3 3 65 atcagc tggtggccgaat tagccaaggcacagccaacacacatgeagtggcggggca tec j - 



1 ! I I 



! ! ! 



I I m I ! I i ! ! 



103 52 0 atcagetggtggccgaattagceaaggcacagccaacacacatgcagtggcggggcatc:: 10346: 



aggaagagacacc tc 2 tgctg>; 



: 1 0 3 4 6 0 



foa a cTaGa c a ^ 



gc tcagaggggaaacagggagtgtg t :::aaag:: j - 
gc t cagagggg aaacagggag T t g t g t c c 2 aaagc t ~ 



01 



iVerv : 3435 cgea tgcaaaaaac t caggcaagc t gacat tcaagccgaaaggacc tgt tctcatgaggc 3 5 44 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 m 1 1 1 1 1 m 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 ! 11 1 1 1 1 1 1 1 

:: b j : 1034 0 0 coca tgcaaaaaac t caggcaagc t gacat tcaagccgaaaggacc tgt tctcatgaggc 1 u jo 41 

0-uerv : 3 34 5 aaagccaacc t ccc t caa: t teat t taataaaacaa taaa t tccaggat tggaaa tacag 3 6 04 

; I I I I j j I i I I I j I I [ I j ! ; M ; \ \ \ \ \ \ \ \\ \ \ \ \ \ \\ \\ \ \ \ \\ \ \ \ \\ \ \ \ \ \ \ \ 

£bj -t : 103340 aaagccaacetccctcaat ttcat t taataaaacaataaat tccaggat tggaaatacag ^-j.:^3- 

^uery : 3^05 tatacat tacaaaaaggacagaggtcatcaatatactgtgtgacct tat tacccccagtg 3 664 

I M I I I I I I M I I M I I I II I i I i I I i I i I M I I I M I I I i I I i i I I i i M I I I i i I M i 

Sbg ct : 1032 3 0 ta tacat tacaaaaaggacagagg tea tcaatatac tgtgtgacct tat tacccccagtg 1 0 j. 22 1 
r, u ^K-y - 3hh r , aagecaca ta taca taaaccaaaaa taaaacc t tat tacagccc tcagtaaa 3 7 16 

II I I ! I I I I I I I I I I III I I INN Mill I I I I I I I I I I I I I I I I Ml ! I I 

Sb jet. : 1032 2 0 aggccacatatacatggaccaaggatggaacct tgt tacagccc tcagtaaa 10 316 9 



'core = 547 bits (276) , Expect - 



cient 1 1 les 



235/288 (98%) 



Strand = Plus / Minus 



Muery : 1701 gt tea t tccagaaccc tggtcagee tgcagtaccacg tgtgggccaggtg tgcagg tccg 1 / 60 

I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 

::bi - t : 17542 1 gt tea t tccagaaccc tggtcagee tgeagtaceaegtgtgggccgggtgtgcaggtecg 1 / ^ j-o2 



6 1 



■O] Ct : i/Jj'). 



egaggtgaagtgecgtgtgctcct raeat tcaegcagac tgagactgage tgccegagga 1 8 2 i 

i i I it M M I I I i I M I M I I I M I I I I I i I i I I M i s I II M M I I I I I I I I M , I I \ 

tgaggtgaagtgecgtgtgctccteacat tcaegcagac tgagactgage tgccegagga 173 ; 



02 



■>jery : 1321 agagtgtgaaggccecaagc tgeccaccgaacggccctgcctcc tggaagca tgtgatga 18 80 

I I I II I I I I I I I I I ! I I I I I I I I M I I M M M I I ! I I I ! I I I M I ! i I i I I ! I I I M ! I 

Sbjct: 173301 agagtgtgaaggccccaagctgcccaccgaacggccctgcctcctggaagcatgtgatga i73242 



yjery: 1881 ?5gc:ca3c::-:ccgacacc:aga:a:c:::::ccc:caGcacac:cacacgGc::a:ga _ 
3b^ct: 1~3241 :agc:c3gc:tccccagagct:aga:a:::c::::cc:gacga:aG:gagacgac:ta:ga 



.941 ::gggag:acgc:Gggc:caccc::tg:a-:agcaaca:gc:tgggagg 1988 
."3181 c tag gagtacgc tgggt tcacccc: tgca :;agcaacatgcgtgggagg 17 3134 



£:ore = 432 bits (213), Expect = e-lL8 
Identities = 218/213 (100%) 
Strand - Plus / Minus 

juery : 47 5 3 agacc :acct taagaaggaac tgcacatcaggggcc tgtgatgtgtgt tggeacacag gc 4812 
3b;ct : 49742 agacccacet taagaaggaac tgcacatcaggggcc tgtgatgtgtgt tgg :acacag gc 49683 

■"luerv : 4813 cc ttggaagccctgtacagcagcctg tggcaggggt ttccagtc tcggaaagtcgactgt 4b 7 2 

i ! I I ! I M I I I ! I I I I I I I ! I M M I M M I M I I I I I I I M I I I I I I ! I II I ! I I ! ! I 

5b j ct : 4 9682 cc ttggaagccctgtacagcagcctg tggcaggggt ttccagtc tcggaaagtcgactgt 4 962 3 
<>aery: 487 3 a t ccacacaaggagt tgcaaacc tgtggccaagagacac tgtgtacagaaaaagaaacca 4932 

M I I 1 : 1 1 1 I 1 1 1 I M I II i I ! I I 1 1 i i I ! i i 1 1 1 1 I i I i I i ! i I i I I 1 1 I I I I i 1 i i i I 

Sbjct: 49622 atccacacaaggagttgcaaacc tgtggccaagagacac tgtgtacagaaaaagaaacca 49563 
guery: 4933 a t t tec tggcggcactgtc t tgggccctcctgtgatag 4970 

Mill I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Sbjct: 49562 at t tec tggcggcactgtct tgggccctcc tgtgatag 49525 



Score = 398 bits (201), Expect = e-107 
Identities = 201/201 (100%) 
Strand = Pius / Minus 



nuery : 4456 aggtggt tcacaagtgtgtggtcacagtgctc tgtgtc t tgcggtgaaggataccacagt 4 5 i 5* 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I i I i I I M 

Sbjct. : 612 77 aggtggt tcacaagtgtgtggtcacagtgctc tgtgtc t tgcggtgaaggataccacagt 612 1£ 

guery: 4516 cggcaggtgacgtgcaagcggacaaaagccaa tggaac tgtgcaggtggtgtc tccaaga 4575 

! I I I I I I I I I I I I I I I I I I : I I : I I I M : I : I i M I I I I : I I I M I I I I! M I ! I I i i I i 

Sbjct: 61217 cggcaggtgacgtgcaagcggacaaaagccaatggaactgtgcaggtggtgtctccaaga 6115E 

guery: 4576 gcatgtgcccc taaagaccggcc tc t gggaagaaaaccatgt t t tggtcatccatgtgt t 4635 

■ I ^ : ' ' : 

Sbjct: 61157 gcatgtgcccctaaagaccggcctctgggaagaaaaccatgttttggtcatccatgtgtt 6109E 



Query: 463 6 cagtgggaaccagggaaccgg 465 6 

1 h ! M M ! i ! ! M ! I M ; 
Sbjct: 6 1097 cagtgggaaccagggaaccgg 6107" 



Score = 387 bits (195), Expect = e-104 
Identities - 195/195 (100%) 
Strand = Plus / Minus 



1116 aggtggcatgtgggctcttgggggccctgct cage tacc tgt ggagttggaattcagacc 

1 I ! I ! I ! I I ! I I I I ! ! i I I i I : i I ! M i i M M i ! i :l ! ! i M I M ! ! i I I ! i ; M i M : 



St) J Ct : 



143 919 aggtggcatgtgggctcttgggggccct get cage tacctgtggagt tggaat tcagacc 14 3 86 0 



<~'uerv : 2 17 6 cgagatgtgtactgcc tgcacccaggggagacccc tgecee tec tgaggagtgecgagat 22 3 5 

! I I I I I I I i I i I I I I I i I I I I I ! i I I I I i I ! I i I ! ! i i i I i i i ! i I i I I i i ! i ! I ! ! i i : 

Sbjct : 1438 5 9 cgagatgtgtactgcc tgcacccaggggagacccc tgcccc tec tgaggagtgecgagat 143800 



Query: 
Sbiet : 



2 23 6 gaaaagccccatgctttaeaagcatgcaatcagt ttgactgccc tcctggctggcacat t 22 95 

I I I I I I I I I I I I I I I I I I I I ! I I I I II II I I I I I I ! I I I I I I I I I I I t I I I I I I I I ! I I 

143 7 99 gaaaagccceatgctttacaagcatgeaatcagtttgactgccctcctggctggcacatt 14:4740 



Querv: 2296 gaagaatggcagcag 2310 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Sbjct: 143739 gaagaatggcagcag 143725 



Score -- 373 bits (188), Expect - e-100 
Identities - 188/188 (100%) 
Strand = Plus / Minus 



r-uery: 42 7 2 agagee tt t t tgggagcctggtaac tggtcacat tgt tc tgccacc tgtggtcat t tggg 43 31 

IMIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIMMIMIIIIIIIIIIIIIIM 

Sbj ct : 6 5105 agagee tt 1 1 tgggagcctggtaac t ggt cacat tgt tc tgccacc tgtggtcat t tggg b5046 



Ouerv : 4 332 agcccgcat tcagagaccccagtgtgtgatggccaatgggcaggaagtgagtgaggccct 43 91 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

Sbjct : 6 5 045 agcccgcat tcagagaccccagtg tgtgatggccaat gggcaggaagt gag tgaggcect 64986 



luery : 4 3 92 gtgtgatcacc tccagaagccactggctgggt ttgagccctgtaacatccgggactgccc 44 51 

I 1 1 I 1 1 ! I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ! I 1 1 I I I I I I 1 1 I 1 1 I II 1 1 I i 

?bjct : 64985 gtgtgatcacctccagaagccactggctgggtttgagccctgtaacatccgggactgccc 64926 



Query: 4 4 52 agegaggt 44 5 9 

1 1 1 1 1 1 1 1 

Sbjct: 64925 agegaggt 64918 



Score = 3.1 bits (187), expect = 3e-99 
Identities = 187/187 (100%) 



3 97 0 aaaggtgtccctcagcctaatataacttggttgaagagaggaggatctctgagtggcaat 4 0 2 9 
"1975 aaaggtg t ccc t cage ct aa t ataact tggtt gaagagaggagga t etc t gag tggcaat 71916 



Query : 4 03 0 gtt tcct tgc t t t tcaatgga t ccc tgt tgt tgcagaa tgt t 1 ccc t tgaaaatgaagga 4 0 3 9 
St jet: "1915 gtt tcct tgcttttcaatggatccctgttgttgcagaatgtttecct tgaaaatgaagga 7 135 6 

Muerv : 4090 acc tacgtc tgcatagccaccaatgc tc t tggaaaggcagtggcaacatctgtac tccac 4 14 9 

: ! ! ! I M I I ! i ! I I I ! I I I I ! M M ; ! ! I M : M ! i M ! ! ! ! M i ! M ! M - ! - ; ■ i : 

Sbjct : ^1855 acctacgtctgcatagccaccaargctct tggaaaggcagtggeaacate tgtac tccac / /96 

nuerv: 4150 ttgctgg 4156 

! I I I I I ! 

Sbjct: '.'17 9 5 ttgctgg 7 178 9 



Score = 361 bits (182), Expect = 3e-96 
Identities - 182/182 (100%) 
Strand = Plus / Minus 

(~>uerv : 2 3 09 agtgttccaggact tgtggcgggggaac tcagaacagaagagtcacctgtcggcagctgc 2 3 68 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i I i 1 1 1 I i i 1 1 i 1 1 i 1 1 1 1 I i 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 

Sbj ct : 143 612 agtgttccaggact tgtggcgggggaac tcagaacagaagagtcacctgtcggcagc tgc 14 jo 5 3 



"'u^ry : 2369 taacggatggcagc tt t ttgaatctc tcagatgaat tgtgccaaggacccaaggcatcgt 24 2 8 

I I I ! I I I I I I I I I I I I I I I i I I I I I ! I I I I I ! ! I I I I I ! I ! I I I I ! ! I i i I ! I ! I I ! I I i 

Ihj ct : 143 55 2 taacggatggcagc ttt ttgaatctc tcagatgaat tgtgccaaggacccaaggcatcgt 14349 j> 



nuerv : 2 42 9 ctcacaagtcc tgtgccaggacagac tg tcct ccacatttagc tgt gggagactggtcga 248 8 

I 1 1 • I I I I I I I I I I I I I ! I I I I I I I I 1 1 i i n II! I I ! I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct: 143492 ctcacaagtcc tgtgccaggacagac tgtcc tccacatt tagc tgtgggagactggtcga ^43^jo 

Query: 2439 ag 2490 

! 

Sbjct: 143432 ag 143431 



Score = 301 bits (152), Expect 
Identities - 155M56 (99%) 
Strand = Plus Minus 



- 2e-78 



2489 agtgttctgtcagttgtggtgttg^aatccagagcicigaaagcaggtgtgt laa^cgctgg ^z^~ 

1 1 60 3 1 agtgttc tgtcagt tgtggtgt tgiiaatccagagaagaaagcaggtgtctcaaa ^gcz? :: _ . 9 

2 54 9 cagccaaaggtcggcgcatccccctcagtgagatgatgtgcagggatctaccagggttc r 2 608 
115971 cagccaaaggtcggcgcatccccctcagtgagatgatgtgcagggatc taccagggctcc 1159: 



juery: 2609 ctct tgtaagatc t tgccagatgcctgagtgcagta 2644 
,bjct: 115911 ctct tgtaagatct tgccagatgcctgagtgcagta 115876 



Score - 266 bits (134), Expect = le-67 
Identities - 134/134 (100%) 
Strand - Plus / Minus 

iuery : 1 9 8 6 aggccatcaagaagccatagcagtgtgc t tacatatccagacccagcagacagtcaatga 2 04 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! ! i I I I i I I I I i ! I I I I I ! \ \ \ I I ; i I i 1 1 : 

.bjct : 161611 aggccatcaagaagccatagcagtgtgcttacatatccagacccagcagacagtcaatga 16255 
iuery : 2 04 6 cage ttgtgtga tatggtccaccgtcc tccagccatgagccaggcctgtaacacagagcc 2105 

III I M i I I I I i I i I i M I i I I i i i i I M i : M M i i I i i i I I I I I I I I i i i i I M ! ii 

■ bj :t : 1 61 551 cage ttgtgtga tat ggtccaccgtcc tccagccatgagccaggcctgtaacacagagcc 1624 9 

juery: 2 I 1 '"-' 6 ctgtccccccaggt 2119 

Mi M M M M M 



.bjct: 161491 ctgtccccccaggt 162478 



Score - 264 bits (133), Expect = 5e-67 
Identities - 133/133 (100%) 
Strand = Plus / Minus 

iuery : 3 84 3 agaggcacctgtcatcttgtctgttgaaagaaatatcaccaaaccagagcacaaceatct 3 9 02 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I 

bj ct : 9 541 9 agaggcacctgtcatcttgtctgttgaaagaaatatcaccaaaccagagcacaaccatct 9 53/0 
uery : 3 9 1-3 y tctg t tgtggttggaggcatcgtggaggcagcccttggagcaaacgtgacaatccgatg 3 9 62 

I I I I I I I I I I I I I I I I I I I I ! I I I I i I I i i I I I I I I I ! ! ! I I I II i I I I I I I i ! I I I i M 

bjct: 95369 gtctgttgtggttggaggcatcgtggaggcagccct tggagcaaacgtgacaa tccgatg 95310 



juery: 3963 tec tgtaaaaggt 3975 

, i ! ! ! i ! i ! 1 i ' 

;b;cr. : 95309 tec tgtaaaaggt 9529' 



S: jre - 154 bits (128), Expect = 5e-6- 
Identities - 128/128 (100%) 
Strand =: Pius / Minus 



171" aataattt tggatggaactgggaaga taeagatacagaatcctacaaggaaagaacaagg j. , o 

: M ! M i ! I M I II I i I I I I I i I I i : I : I i i I \ \ I I M I I M I I I I I ; i i : i I I I i I I i = 

97812 aataattttggatggaactgggaagatacagatacagaatcctacaaggaaagaacaagg 97 ;63 



577" catatatgaatgttctgtagctaatcatcttggttcagatgtggaaagttcttctgtgct 3 8 j o 
977 6 2 catatatgaatgt tc tgtagctaatcatct tggt tcaga tgtggaaagt tcttctgtgct 97703 



3837 gtatgcag 3844 

I I I I I I I I 
97702 gtatgcag 97695 



Score - 254 bits (118), Expect = 4e-58 
Identities - 118/118 (100%) 
Strand = Plus / Minus 

"iuery : 4156 gaacgaagatggccagagagtagaatcgtat ttctgcaaggacataaaaagtacat tc tc 4215 

||l!!lllljll!lllll!IMIII!IM!llll!IIIIIMMI!IIIEIIII!l!!!l 

3b j cr. : 7 0131 gaacgaagatggccagagagtagaatcgtatttctgcaaggacataaaaagtacattctc 7 0 072 



16 caggcaaccaacactagaaccaacagcaacgacccaacaggagaacccccgcctcaag 4 27 3 



1 1 I 



, , I i ; I I I I I I ! I I I i I I i I I I I I I ! I I I I I I I I I I I I I ! I I I I I I I I I : I \ I I 

3b j cr. : 7 007" 1 caggcaaccaacactagaaccaacagcaatgacccaacaggagaacccccgcctcaag / 0014 



Score = 112 bits (107), Expect = 2e-51 
Identities = 107/107 (100%) 
Strand = Pius / Minus 

"-ue-/ - 4 c '7n oagactgcacagacacaactcactactgtatgt ttgtaaaacatct taatttgtgttctc 5029 

: i I I I i i I i I i I I I I I I I I I I I I I : I I i i I M I I I II II ! i i I I I I i I I i M i I M i I i I 

Sb: :t : 48813 gagactgcacagacacaac tcac tactgtatgt ttgtaaaacatct taat t tgtgttctc 48/54 
"iuery : 503 1 1 tagacege tacaaacaaaggtgctgccagtcatgtcaagagggataa 5 07 6 

I I I I I I I I I I I I I ! I ! ! II I I I I M I I ! ! ! I I ! ! I I I I I I I ! I I I ! 

Sbict : 48753 taaaccac tacaaacaaaggtgctgccagtcatgt caagaggga taa 48707 



Score = 196 bits (99), Expect 
Identities - 99 99 (100%) 
Strand = Plus Minus 



4656 gtotcctggacgt tgcatgggccgtgccgtgaggatgcagcagcgtcacacagcctgtca 4 - Id 
5 5 17 9 gtatcctggacgt tgcatggg tcgtgctgtgaggatgcagcagcgtcacacagcttgtca d z ^i , 



,ery: 4716 acacaacagctc tgac tccaac tgtgatgacagaaagag 4754 

! ; i i J j j ! I I I M M I I I M M ; ! j j ; j | I ; 1 : i J | ■ j : 
;ct: 55119 acacaacagctc tgac tccaac tgtgatgacagaaagag 55031 



AC0126S4 .5.1. 164879 

Length = 164879 

Score =: 2117 bits (1068), Expect - 0.0 
Identities = 1071/1072 (99%) 
Strand = Plus 1 Minus 

■uery : 1645 aaa tcaaatcagagatgaagacaaaac t tgg tgagcagggtccgcaga t cc t cag tgtcc 27 04 



i 



I 



Mill! i 



tncr: jo 



32 9 aaatcaaatcagagatgaagacaaaacttggtgagcagggtccgcagatcct cag tgtcc 15 770 



lu^rv: 2705 agagagtc tacat tcagacaagggaagagaagcgtat taacctgaccat tggtagcagag 2 , 64 

i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ii 1 1 1 1 

b jcT : 537 6 9 agagagtc tacat tcagacaagggaagagaagcgtat taacctgaccat tggtagcagag j. jJ710 

■u^rv : 17 65 cctat ttgctgcccaacacatccgtgat tat taagtgccccgtgcgacgat tccagaaat 2 82 4 

! ! ! ! I ! ! ! ! I! I ! ! ! ! ! ! ! I i ! I ! ! I ! ! ! II ! I II I ! I ! ! I ! ! ! ! i ! I M ! I ! I I I M ! i 

,r get : 33709 cctat ttgctgcccaacacatccgtgat tat taagtgccccgtgcgacgat tccagaaat 33 650 

oerv: 2 82 5 c tc tgatccagtgggagaagga tggccg t tgee tgcagaactccaaacggc t tggcatca 2 884 

I I I I I I I I I I I I I I I I I I I I I I I I i I ! ! ! I I I I I i I I I I I I i I I I I ! I I I I i I I I I I I ! I 

bjet : 3 j 649 c tc tgatccagtgggagaagga tggccg t tgee tgcagaactccaaacggct tggcatca jo 9 0 

iuerv : 2885 ccaagtcaggctcactaaaaatccacggtc t tgc tgcccccgacatcggcgtg taceggt 2944 

I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! ! I I I I I I I I I I I I I ! I I I I I I I I I I I I 

bjet : 33 58 9 ccaag t caggc tcactaaaaatcca tgg tct tgc tgcccccgacatcggcgtg taceggt j j o j u 

iuery : 2 94 5 gcat tgeaggetc tgcacaggaaacagt tg tgc tcaagc teat tgg tact gacaaccggc 3 Cm) 4 

■ bjet: 33529 gcat tgeaggetc tgcacaggaaacagr tgtgc t caagctcattggtac tgacaaccggc 3j-47u 

juerv : 3005 tcatcgcacgcccagccctcagggagcc tatgagggaatatcctgggatggaccacagcg 3064 

J j j : J j j j | | | | 1 1 | | 1 ! J | ! J | j J J J j J | M ; i J j ; | ! J J j | j | j J M ! j j ] j I I ; I | \ 

^bjet : 3 3 469 tcatcgcacgcccagccctcagggagcctatgagggaatatcctgggatggaecacagcg 33410 



Query: 3065 aagccaataccttGggagtcacaiggcaCaaaaCGaggcaadCgtggaataacaaaaGtg j^2- 
Sbjct: 3 j- 4 0 9 aagccaatagtttgggagtcacatggcacaaaatgaggcaaatgtggaataacaaaaatg 33ju 

Query : 3 12 5 acct::a:ctggatga:gaccaca::ag:aaccagcc:::c::gagagc:c:c:caggc z 318- 



:ct:tatctggacgatgaccaca::ag:aa-:cagcc::t:c:t:gagagc:c:g: 



:aagc: jj^ 



Query : 3 It 5 ac tgcagcaattctgcaggaagcaccaac tec tgggagt tgaagaa taagcagt ttgaag j 
Sb;ev : 3 3 2 8 9 a r tgcagcaattctgcaggaagcaccaac tee tgggagt tgaagaataagcagt ttgaag 3 j.^3 u 



r.uprv : 314 5 cagcagt taaacaaggagcata tagcatgga tacagcccctgt 1 1 ga tgagc tga ta^gaa j 

1 1 1 ! j 11 1 j I I I i | ! i ! I i ! I I i ! ! ! ! ! ! I M M M ; I I I i ! : i I M 1 1 



3 312 9 cagcagt taaacaaggagcata tagcatgga tacagcccagtt tga tgagc tga taaga a 3 '.: 17 0 



nuer,' : 3 3 ','5 aca tgagt cage tea tggaaaccggagaggt cagegatgatet tgcgtcccagc tga ta t 3 3 64 

I ! ! M I I i I I I I ! I 1 I I I M ! I I I ! ! ! I ! ! 1 I ! I ! I I ! M i M I I I I M ! ! M ! ! I i i ! 

Sbjc-. : 3:.-. 1 6 9 aeatgagtcagctcatggaaaccggagaggtcagcgatgatcttgcgtcccagctgatat 3.: 110 
<~'uery : 3 3 * : . 5 atcagctggtggccgaattagccaaggcacagccaacacacatgcagtggcggggcatce 3414 

I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I II I I I I I I I I I I I I I i i 

Sbjct : 3 3 109 atcagctggtggccgaattagccaaggcacagccaacacacatgcagtggcggggcatcc 3.:-u50 
<~<uerv : 3 415 aggaagagacacc tec tgetget cage tcagaggggaaacagggagtgtg tec caaage t 3 4 84 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 

Sbjct : 33049 aggaagagacacc tcctgctgctcagctcagaggggaaacagggagtgtg tcccaaagc t 32990 
iiuerv : 3435 cgcatgcaaaaaactcaggcaagc tgacat tcaagccgaaaggacctgt tctcatgagge 3 544 

I I I I I I I i I I I I I I I I I I I I I I I i ! I M I I I ; i i i I ! I I I i i I ! I i M i i i : \ ! I ! I i i \ 

Sbjct : 3 2^89 cgcatgcaaaaaac tcaggcaagc tgacat t caagccgaaaggacc tgt tc tea tgaggc j29 j0 
uuerv 3 5 4 5 aaagccaacc tccc tcaatttcat ttaataaaacaataaat tccaggat tggaaatacag 3 604 

l!llll!lllll!IM!IIIIIIIIMIIIIIIIIIIIIIIIIMIMIIII!MII!ll 

Sbjct : 32929 aaagccaacc tccc tcaatt teat t taataaaacaataaat tccaggat tggaaatacag 328 70 
■~>uerv : 3 6 05 tatacat tacaaaaaggacagaggtcatcaatatactgtgtgacc ttat tacccccagtg 3 6 64 

I I I I I I I i I I I I I I I I I II I I I I I I I i ! I I I I I I I I I I I I ! I I I I I M I I I I I I I I I I I ! 

Sbjct : 3 2 869 tatacat tacaaaaaggacagaggtca tcaa tat actgtgtgacc ttat tacccccagtg 32810 
Ouery : 3 6 65 aggccacatatacatggaccaagga tggaacc ttgt tacagccc tcagtaaa 3716 

I ! I I I i I I I i I i I I I I I I ! ! i i ! i ! I I I i ! i I I I I ! I i i I I I i ! I i i I 1 1 I I 

Sbjct: 32809 aggccacatatacatggaccaaggatggdaccttgttacagccctcagtaaa 32/58 



Score = 54" bits (276), Expect = e-152 
Identities = 285/288 (98%) 
Strand = Plus . Minus 



ruery : 17 01 gt teat tccagaaccctggtcagcc tgeagtaccacgtgtgggccaggtgtgcaggtccg j. / -^J 

i i i II i i i II i i I M I i i I ! i i M i i I i M M i I i i i ! i i ! M I I I i ! M ! ! i 

.-; : 10 3 010 gttcattccagaaccctggtcagcctgeagtaccacgtgtgggccgggtgtgcaggtccg 102 951 

nuery : cgaggtgaag tgccgtgtgc tec tcacatt cacgcagac tgagac t gage tgcccgagga 132 1 

Sbjct : lul 9 50 tgaggtgaag tgccgtgtgc tec tcacatt cacgcagac tgagac t gage tgcccgagga lOL.c.91 

r^uer/ : 132 2 agagtgtgaaggccccaagc tgcccacegaacggcce tgee t cc tggaagca tgt gatga 1830 

I I I I I I I I I I i I I I I I I I I I I ! I I I I I ! . I ! I I I I ! I I I I i I I i i I I I i I I i I I I ! ! I M 

St' j ct : 102 8 90 agagtgtgaaggccccaagc tgcccacegaacggcce tgee tec tggaagca tgt gatga 10 2 o3 ± 
<~'uerv : 188 1 gagcccggcc teccgagagctagacatece tctccc tgaggacagtgagacgact tacga 1040 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 i l i 1 1 i i 1 1 1 i 1 1 1 i 

Sbjct : 102830 gagcccggcctcccgagagctagacatccctc tccctgaggacagtgagacgact tacga loi , 71 
r.uer - : 1 ,; '4 : c tgggagtacgetgggt tcacccc t tgeaeagcaacatgc t tgggagg 198 3 

I I I I I I I I M I ! I I I I I I I I I ! I I I I I I i I I I I I I I ! I M MINI! 

Sbjct : 102770 ctgggagtacgctgggttcaccccttgcacagcaacatgcgtgggagg 102723 



Score = 408 bits (206), Expect = e-110 
Identities = 206/206 (100%) 
Strand - Plus / Minus 



r-u^rv : 12 62 gc tgggaacataatcct tggac tgea tgt tccgtgtcc tgtggaggagggat tcagagac 13 2 1 

MIMIIIIIIIIMIIIIIIIIIIMI'IIIIIIIIIIIMIIMMMIIIMIIMI 

Sbjct : 123415 gctgggaacataatcct tggactgcatgttccgtgtcc tgtggaggagggat tcagagac 1 J _-.5b 



i~'u^rv : 1322 ggagctttgtgtgtgtagaggaatccatgcatggagagatattgcaggtggaagaatgga 138 1 

IIIIIIIIIIIIIMIIMIIIIIIIILIIIIIIIIIIIMIIIIIIIIMIIMIIII 

Sbjct : 12 3 355 ggagctttgtgtgtgtagaggaatccatgcatggagagatattgcaggtggaagaatgga 12 .^9t> 



<>u^rv : 13 8 2 agtgcatgtacgcacccaaacccaaggttatgcaaacttgtaatctgtttgattgcccca 144 1 

I I I I II! I I ! I I I I M I I I M I I I ! I I I ; I I II I ! I I I I I I I I I I I I I I I I I I I I I I I M 

Sb] ct : 12 3 2 95 agtgcatgtacgcacccaaacccaaggttatgcaaacttgtaatctgtttgattgcccca 12 j23 6 



Query: 1442 ag tggattgccatggag tggtc t cag 1467 

i I ! i I i I I i i i i i i i i i i I I i I I i I i 

Sbjct: 123235 agtggat tgccatggagtggtctcag 123210 



SfcDre = 387 bits (195), Expect = e-104 
:de-::::es - 195/195 (100%) 
Strand = Plus Minus 



aggtggcatgtgggctcctggggg^ D 
Sbj rt : "3508 aggtqgcatgtgggctct tgggggccctcctcagetacctgtggagt tggaat tcagacc . 34^ 

nuprv : 2 17 6 cgaaa:g:gt:actgcctgcacccaggggagaccccrgcccc:cc:gaggag:gccgaga: 22 3 d 

MUM | | | | | I | | ! ! | ! M I! I I M M I I I ! ! ! ! ! ! I ! ! ! I I I I ! I ] I I I i ! , 

St : -t : "3448 cgagatgtgtac tgectgcacccaggggagacccctgcccc tcctgaggagtgccgagat > 3389 
Ouerv : 22 3 6 gaaaagcccca tgc t t tacaagcargcaatcagtcrgac tgccc tcctggc tggcacat t 22 9 5 

I I I ! I I I I I I I i I I I ! II I I I i I I I M I I I I I I I I I I I I i I I I M I I M I I M I I M I I I 

St : ::: : "3 388 gaaaagcccca tgc t ttacaagcatgcaa tcagtttgac tgccc tec tggc tggcacat t 7 jo2 9 
Ouerv: 2296 gaagaa t ggcagcag 2310 

! i I I i I I I I I I I I I I 

St jet. : 7 3 328 gaagaa t ggcagcag 73314 



Score - 371 bits (187), Expect - 3e-99 
Identities = 187/187 (100%) 
Strand = Plus / Minus 

Query: 3 97 0 aaaggtgt ccct cagee taata taacttggt t gaagagaggaggat c t c tgagtggcaa t 4029 

i i I | i I I i I I I I ! i i I I I M M i I I II I I ! I I : I I I M I I M I I I i i I ! Mill! I I I I I 

St jet : 15 64 aaaggtgtccctcagectaatataacttggttgaagagaggaggatetctgagtggcaat loOS 



uuPrv: 4 03 0 gtttccttgcttttcaatggatccctgttgt tgcagaatgtt tccc t tgaaaatgaagga 4 08 9 

I I I I I I ! I I ! I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I 

Sbjct : 1504 gtttccttgcttttcaatggatccctgttgttgcagaatgtttcccttgaaaatgaagga 1445 



10 90 acctacgtctgcatagccaccaatgctcttggaaaggcagtggcaacatctgtactccac 414 9 

I I I ! ! I I ! I I I I I I I ! I I I I I I I I I I I ! I I I ! I I I I I i I i I I I I I I I I I I 1 I I I I I I I M 

.4 44 acctacgtctgcatagccaccaatgctcttggaaaggcagtggcaacatctgtactccac i3 8 s 



Query: 4150 ttgctgg 4156 

I I I I I I I 
Sbjct: 1384 ttgctgg 1378 



Score = 361 bits (182), Expect = 3e-96 
Identities = 182/182 (100%) 
Strand = Plus . Minus 



2 3 09 agtgt tccaggact tgtggcgggggaactcagaacagaagagtcacc tgtcggcagc tgc 2 3 08 

! i I! i i I i ! I i i i ! i M M ! i i I I ! i i i ! i ; I ! : i i ! I ; i M M I i i m : ! \ ! I ; i i i i ; 

"3 2 L)l agtgt tec aggacttgtggc gggggaac tcagaa cagaagagtcacc tgtegge age tgc 7 j 14: 



2 3 69 taacggatggcagcttt ttgaatctctcagatgaattgtgccaaggacccaaggcatcgt 241 8 

| | j | | j | | j | | | | | | | | ! | j | | | M ! | ! | M ! M | | | J | | ! | j i j | | j j M ! M | M ! M 
^3141 taacggatggcagcttt ctgaatctctcagatgaattgtgccaaggacccaaggcatcgt 7308; 



2 42 9 ctcacaagtcctgtgccaggacagactgtcctccacatttagctgtgggagactggtcga 24c 8 
7 3 081 ctcacaagtcctgtgccaggacagactgtcctccacatttagctgtgggagac tggtega 7 3 02; 



vuerv: 24 3 9 ag 24 9 0 
I I 

Sbjct: ".3021 ag 73020 



Score ■-- 305 bits (154), Expect = le-79 
Identities = 157/158 (99%) 
Strand - Plus / Minus 

nuer/ : 8 03 ttattgaatcaaaaacacttcaaggaagcaaaggagaacacagctt taacagccccggcg 8 62 

II II II II M II II II II 0 II I II II I II I II II II I i II 0 II o II o o II o I i 

Sbj ct : 14 5296 t tat tgaatcaaaaacacttcaaggaagcaaaggagaacacagctt taacagccccggcg l<t5237 
/■uer; : k63 tctttgtcgtagaaaacacaacagtggaatttcagaggggctccgagaggcaaactttta 9 2 2. 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 

Sbjct : 14 5236 tctt tctcgtagaaaacacaacagtggaat ttcagaggggctccgagaggcaaac t t t ta 14:>r / / 
nuerv: 923 agat tccaggacc tctgatggc tgat ttcatct tcaag 960 

II I ; I I II I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I 

Sbj ct : 14 517 6 agat tccaggacc tctgatggc tgat ttcatct tcaag 14 513 9 



Score = 301 bits (152), Expect = 2e-78 
Identities - 155/156 (99%) 
Strand = Plus ./ Minus 

nuery: 2489 agtgt tctgtcagt tgtggtgt tggaatccagagaagaaagcaggtgtgtcaaaggc tgg 2548 

I I I I I I I I ! I I I I I I I I I I I I I i I I I I I I I I I I I i I I M I I I I I i I I I I I I I I! M I I M 

Sbjct: 45620 agtgttctgtcagttgtggtgttggaatccagagaagaaagcaggtgtgtcaaaggctgg 45b61 



;ery : 2 54 9 cagccaaaggtcggcgcatccccc:cag:gagaugacg:gcaggga:c:accaggc::cc 2 6uS 
rc: : 4 r 56C cagccaaagg:cggcgca:ccccc:cagt:gagacga:g:gcaggga:c:accagggc: 



. (_ ! „ m j j j ^ 



?ry: 26 09 c:c::gcaaga^c::g-:caga:gcc:gag:gcag:a 2 644 

i i i ! ! I i i ! i M i : i ! i i ; i I I i i i i ! 



4:500 ctct tgtaagatcttgccagacgcctgagtgcagta 45465 



Sccre = 29" bits (150), Expect = 4e-77 
Identities - 150/150 (100%) 
Strand - Plus / Minus 

~>u^v • 1466 agtgcacagtaact tgtggccgagggt tac ggtaccgcgi tgt tctgtgtat taacca :c ^:2d 

! M i I I I I I I I i I ! i i I I ! i M I ! ! : I ! i ; ! : ! I M M ! ! I M I I: : i i i i : i ! i ! I i i ! 

Sbj ct : 118242 agtgcacagtgacttgtggccgagggttacggtaccgggttgttctgtgtattaaccacc 1 18i8 j 
Mvp^y. -i -,26 gcggagagcatgttgggggctgcaatccacaactgaagt tacacatcaaagaagaatg tg 1555 

I I I I I I I I I ! I M I I I I I I I I I II I I I I I I i ! I ! I I ! I I I I i I I I I I I I I I I I I I I MM 

Sbj ct : 118182 gcggagagcatg t tgggggc tgcaatccacaactgaag t tacaca t caaagaagaatg tg ^8^j 

■;uery: 1586 t cattcccatcccgtgt tataaae caaaag 1615 

i M M ; M I I I i II M i ! I ! I I i M M ! i I 
Sbj ct. : 118122 tcattcccatcccgtgttataaaccaaaag 118 0 93 



Score = 280 bits (141), Expect = 8e-72 
Identities - 141/141 (100%) 
Strand = Plus / Minus 

<~iu^ry : 107 1 aggt tatcagctcaattctgctgaatgtgtggatatccgc t tgaagagggtagt tcctga 112 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I ! I I I I I I I i I I I I I I I I I I I I 

Sbj ct : 12 5991 agg t ta tcagctcaattctgctgaatgtgtgga tat ccgct tgaagagggtagt tcctga ±2 t9 32 
riuery ■ 1 " 3 1 ccat tat tgt cac tact accctgaaaatgtaaaaccaaaaccaaaactgaaggaatgcag 1190 

I | I I I I i ! I I I I I I I I I I I I I I I I I I I ! I I I i I i I I I I i I I I I I I I ! I I I I I ! ! I I I i W 

Sbj ct. : 12 5 931 ccat tat tgt cac tac taccctgaaaa tgtaaaaccaaaaccaaaac tgaaggaatgcag 12 58 i2 
■■~>uery: 1191 catggatccctgcccatcaag 1211 

I I I I ! I I ! I I I i I I I i i I i i 

Sbjct: 125871 catggatccctgcccatcaag 125851 



Score = 266 bits (134) , Expect = le-5' 
Identities - 134 134 (100%) 
St rare = Plus Minus 



guery : 193 6 aggeca t caagaageca tagcagtgtgct ta tatatccagac ::agcagacag:caa:c.a j. d 

. i , , !■<!... i i . . . i ■ 

Sbj ct : 922 0 0 aggeca t caagaageca tagcagtgtgc t taca tat ccagacc :agcagacag:caa:ga 9214 



104 6 cage: tg tgtgatatggtccaccgtcctccag ccatgagccaggcctgtaacaCagagcc 2 1 u d 
92 14 0 cagcttgtgtgatatggtccaccgtcctccagccatgagccaggcctgtaacacagagcc 92 03: 



luerv: 21 3 6 c tgtccccccaggt 2 119 

I I ! I ! I I I I I I ! I I 

Sbjct: 92080 ctgtccccccaggt 92067 



Score - 264 bits (133), Expect = 5e-67 
Identities = 133/133 (100%) 
Strand - Plus / Minus 



nuery : 38 4 3 agaggcacc tgtca tc t tgtctgttgaaagaaatatcaccaaaccagagcacaacca tc t 3 9 02 

I I I I I I I I I I I i 1 I I I I I I i II ! ! I i I I ! I I I I I I M I I I I I ! I I I I I i I I I M M : I I 

Sbjct: 25018 agaggcacctgtcatcttgtctgttgaaagaaatatcaccaaaccagagcacaaccatc: 249^9 



ouerv : 3 9 03 gtc tg t tg tggt tggaggcatcgtggaggcagccc t tggagcaaacgtgacaa t ccga tg j9o2 

I I I I I I I i I I i I II I I I I I I I I I I I I I I I I I I I M I I I I M I I ! II I I I I I I I I i I I ; M 

Sbc ct : 24 958 gtctgt tgtggt tggaggcatcgtggaggcagccc t tggagcaaacgtgacaa tccgatg 24S99 
ouery: 3 963 t cc tgt aaaaggt 3 97 5 

I I I I I I I ! I M I 

Sbjct: 24398 tec tgtaaaaggt 24886 



Score -- 254 bits (128), Expect = 5e-64 
Identities = 128/123 (100%) 
Strand - Plus / Minus 



Ouerv: 3717 aataat t t tggatggaactgggaagatacaga tacagaatcc tacaaggaaagaacaagg 3V76 

I I I I I I I I I I I I I I I I I I I I i I I I I I I I I M I I M I I I i I II I I I ! I i I I I I I ! i I I ! I i 

Sbjct : 27411 aataattttggatggaactgggaagatacagatacagaatcctacaaggaaagaacaagg 2 > 35: 



Ouery: 3777 catatatgaatgt tctgtagctaatcatct tggt tcagcttgtggactagt tcttctgtgct j o 3 u 

i I i i I I I I I M I I I I I I I I I I I I I I I I I I I I I i i i I i I I I I M I I I I I I I i ! ; I i I I ! M 

Sbjct: 27351 ca tatatgaa tgt tctgtagct aa tea tct tggt tcagatgtggaaagt tc t t c tgtgc t 2/29^ 



;uery: j8j, gtatgcag 
Yd^zz: 2 "2 91 c:a:ccac 2 "2 84 



Score = 230 bits [116), Expect = 7e-5~ 
Identities - 116/116 (100%) 
Strand = Plus / Minus 



■>erv: 959 agaccaggtacactgcagccaaagacagcgtggttcagt tct tcttt taccagcccct tea 1018 

I I I I I I II i I M i I i I II I I I I II I ! ! I I I M ! I i I I I M I II I I I I I I I M M I h i ; i 

Sbjct : 130°99 agaccaggtacactgcagccaaagacagcgtggttcagttcttcttttaccagcccatca 1j 0940 
Muer\ r : 1019 gtcatcagtggagacaaactgacttctttccctgcactgtgacgtgtggaggaggt 10 74 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i ; s 1 1 1 1 1 : s 1 1 ; : 1 1 : i 1 \ \ : 1 1 i 1 1 1 \ \ 1 ! 1 1 1 1 1 ' 1 

3b jet : 13 093 9 gtcatcagtggagacaaactgact tctt tccctgcac tgtgacgtgtggaggaggt 13 0884 



Score = 176 bits (89), Expect - 9e-41 
Identities - 89/89 (100%) 
Strand = Plus ./ Minus 

■~»u^rv: 1614 agaaaaaagtccagtggaagcaaaattgccttggctgaaacaagcacaagaactagaaga 1673 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! i 
Sbjct: 116453 agaaaaaagtccagtggaagcaaaattgcct tggc tgaaacaagcacaagaac tagaaga 1163 94 

nuer/: 1674 gaccagaatagcaacagaagaaccaacgt 1702 

! ! I ! I I ! ! I I I I ! I I I ! I i ! ! ! ! ! ! M ! 

Sbjct: 116393 gaccagaatagcaacagaagaaccaacgt 116365 



Score = 149 bits (75), Expect = 2e-32 
Identities - 75/75 (100%) 
Strand = Plus / Minus 



/28 gagaagaaaatgtaat tgc tgt tcctttgggaagtcgaagtgtgagaat tacagtgaaag 78 / 

I i i 1 1 i i i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M i m 1 1 1 ii 1 1 1 1 1 1 1 i 1 1 1 1 1 i I i 1 1 1 1 ; 

157331 gagaagaaaatgtaattgctgttcctttgggaagtcgaagtgtgagaattacagtgaaag It / 2 <2 



'~>uery: 788 gacc tgcccacc tc t 802 

I ! I I I I I I I I I I I I I 

Siojcc: 157271 gacctgcccacctct 157257 



Score = 105 bits (53), Expect = 3e-19 
Identities = 53 53 (100%) 
Strand = Plus Minus 



Query: 1210 agtgatggat t taaagagataa tgccct at caccac t tccaacc tc t t cctcg ^zoz 
Sbjct: 12477 1 agtgatggatt taaagagataa tgccct atgaccact tccaacc tc t tccccg 



>AC11615^ .3.1 .75612 

Length = 75612 

Score = 547 bits (276), Expect = e-152 
I lent i ties = 2 85, 288 (98%) 
Strand - Plus / Minus 



nuerv : 17 01 gttcattccagaaccctggtcagcctgcagtaccacgtgtgggccaggtgtgcaggtccg 17 60 

I I I I ! I I I I I I I I I I I I I I I I I I I I ! I I 1 I I I I i I I I I I I I I ! I I I I I I ! I I I I i I I ! 

Sbjct : 42 6 8 gttcattccagaaccctggtcagcctgcagtaccacgtgtgggccgggtgtgcaggtccg 42 0 9 



f, u ^ v . i -' ( 5 1 rgaggtgaagtgccgtatgc tec tcacattcacgcagactqaqact gage tgcccgagga 182 0 

III! M I I |j| I II I I II I I ! I I I I ! I I : ; I I I I I ; I I ! I ! ! I ! I ; I ! I ! ■ ! I I ! i ' 

Sbjct : 4 2 08 tgaggtgaagtgccgtgtgc tcctcacat tcacgcagactgagac tgagc tgcccgagga 4 j..,9 
uu^ry : 1821 agagtgtgaaggccccaagctgcccaccgaacggccctgcctcctggaagcatgtgatga 18 3 0 

I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I l ! I ! I I I I ! I ! I I I ! I I I I I I I I I I I I I 

Sbjct : 4148 agagtgtgaaggccccaagctgcccaccgaacggecctgcctcctggaagcatgtgatga 4039 
<~iuerv • 1831 gagcccggcctcccgagagctagacatccctctccctgaggacagtgagacgacttacga 194 0 

I I I ! I I I I I ! I I I ! ! I I I ! ! I I I ! I ! I I I I I I ! ! I I I I I I I ! I ! ! ! ! I ! ! I I I I ! > < ! I 

Sbjct. : 4088 gagcccggcctcccgagagctagacatccctctccctgaggacagtgagacgacttacga 4029 
nuerv : 1941 ctgggagtacgctgggttcaccccttgcacagcaacatgcttgggagg 19 88 

I I I I I I ; I I ! I I I I I I I I I I I ! I I I I I I I I I ! I I I I I ! I I i I I i I I I 

Sbjct : 4028 c tgggagtaege tgggt tcacccc t tgcacagcaacatgcgtgggagg 3981 



Score = 408 bits (206), Expect - e-110 
Identities = 206/206 (100%) 
Strand - Plus / Minus 

:\erv : 12 62 gctgggaacataa t cc t tggac t gca tgt tccgtgtcc t g tggaggaggga t tcagagac 13 2 

I M M I I I II I I i I I I I I M i M i i I i I i I I I M i ! I I I i I M I I I I M i I I I i i i i I I i 

Sbjct : 246S9 gctgggaacataatccttggactgcatgttccgtgtcctgtggaggagggdt tcagagac 24 i j 

Ouerv : 13 2 2 aaaac t t t atat a t a t aaaggaa tccat gca tggagaga tat tgcaggt ggaagaatgga 138 

Mill i ! ! T 1 7 1 T ] T ! Ml I j I I I I I I I I i : i i ' I ! ! I I I I I i I I I I I I M I ; M I 1 : ! 
Sbjct : 24629 ggagc t t t gtgtg tgtagaggaa t ecatgea tggagagata t tgcagg tggaagaa tgga 245 



Query: 1332 agtgcatgtacgcacccaaarccaaggttatgcaaactrgtaatctgcccgartgcccca 
Sb^tt: 24565 ag:gca:g:acgcacccaaa-:ccaagg::a:ccaaact:g:aa:c:g:i:ga::gcccca 



?ry: 1442 agtggat tgccatggagcggtctcag ^6, 
: rt : 24509 agtggat tgccatggagtggtctcag 24484 



ScDre = 505 bits (154), Expect = le-79 
Identities = 157'158 (99%) 
Strand = Plus / Minus 



Query : 8 03 ttat tgaatcaaaaacact tcaaggaagcaaaggagaacacagct t taacagccccggcg 8b.: 

Sbj - t : 46577 ttat tgaatcaaaaacact tcaaggaagcaaaggagaacacagct ttaacagccccggcg 4 6518 

rrery : 8 63 tct t t at cgtagaaaacacaacagtggaat t tcagaggggctccgagaggcaaac t t t :a 92 2 

i i I i M I I I I I I I I I I I I i I i I I I I I I i I i I I I i I I I I i I I I I I M I M I I i I i I I I I ; 

Sbi ct. : 4 6517 tct ttctcgtagaaaacacaacagtggaatt tcagaggggc tccgagaggcaaac 1 1 1 ta 4 6458 



">uery : 92 3 agat tccaggacctctgatggctgat ttcatct tcaag 96 0 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
?bjet: 46457 agat tccaggacctctgatggctgat ttcatcttcaag 46420 



Score = 297 bits (150), Expect - 4e-77 
Identities = 150/150 (100%) 
Strand - Plus / Minus 

uuery : 14 6 6 agtgcacagtgae t tgtggccgagggt tacgg taccgggt tg ttc tg tgtat taaccaec 152 5 

I I I I I I I I I I I ! ! I I I I ! ! I ! I i I i i i ! I I : i I I I I i I ! I I i ! I I I i I I I I i i I I I i i I i 

Sbjcr. : 19516 agtgcacagtgact tgtggccgagggttacggtaccgggttgttctgtgtat taaccaec 1945 / 
uuerv: 152 6 geggagagcatgt tgggggc tgcaatccacaac tgaagt tacacatcaaagaagaatgcg 1585 

1 1 1 1 1 i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Sbjct : 19456 gcggagagcatgttgggggctgcaatccacaactgaagttacacatcaaagaagaatgtg 19397 
Ouerv: 1586 tcattcccatcccgtgttataaaccaaaag 1615 

I I i ! I I I I I I I i I I I ! I M I II I I M I M i 

Sbjct: 19396 tcattcccatcccgtgttataaaccaaaag 19367 



Score = 280 bits (141), Expect 
Identities - 141.141 (100%) 
Strand = Plus Minus 



- 8e-72 



)71 aggt tat cage tcaat tc tgctgaatgtgtggatatccgcttgaagaggg tag ttcctga 1130 

i ! : j \ \ \ I ; ! ! i i ! ! : : : i ■ ! : ! ! ! I : : , i : . \ \ \ \ I \ i ! ! ! : M ! M : \ I ! ; i i ; ! ; 

'265 aggttatcagctcaat tctgctgaatgtgtggatatccgct tgaagagggtagt tcctga 2 . 206 



uuery: 1111 ccattat tgtcac tactaccc tgaaaatataaaaccaaaaccaaaac tgaaggaatgcag 1190 

! M ! I i I I I ! ! I ! I I 1 I ! ! I I I ! M ! ! ! i i i ; M M I ! ! I ! I i M i ! MINI ! HIM: 

Sbyrt: 2^205 ccattattgtcactactaccctgaaaatgtaaaaccaaaaccaaaactgaaggaatgcag 2,^o 



1191 catggatccctgcccatcaag 1211 

M M M I M M M M M M I 

27 145 catggatccctgcccatcaag 27125 



Score =- 230 bits (116), Expect = 7e-57 
Identities - 116/116 ( 100%) 
Strand - Plus / Minus 



M'uery: Q 5' : ^ agaccagg tacactgcagccaaagacagcgt gg 1 1 cag t tcttct 1 1 taccagcccatca 1018 

I I I I ! I I I I I I I I M I I M I I M I i I i I I i i i I M I I I I i I I I I I I I i i I M I M I I I I I 

Sbjct : 3 2 273 agaccaggtacactgcagccaaagacagcgtggttcagt tcttc t t t taccagcccatca 3 2214 
uuerv : 10 19 gtcatcagtggagacaaactgacttctttccctgcactgtgacgtgtggaggaggt 107 4 

I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I II I I i I I M I I i M I i i i II i I 

Sbjct : 32213 gtcatcagtggagacaaactgacttctttccctgcactgtgacgtgtggaggaggt 32158 



Score = 176 bits (89), Expect = 9e-41 
Identities = 89/89 (100%) 
Strand - Plus / Minus 

■;>uery : 1614 agaaaaaagtccagtggaagcaaaa t tgee t tggctgaaacaagcacaagaac tagaaga 167 3 

M i i I i i i i I M i i I I i I i i i i i i i i I I i i i i i i I M M i I i i M M I I i i i I M i I ! ! I 

Sbjct : 17 727 agaaaaaag tccag tggaagcaaaat tgee t tggctgaaacaagcacaagaac tagaaga 17668 
■juery: 1674 gaccagaa tagcaacagaagaaccaacgt 1702 

M M M M M M M M M M M M i M : : 

Sbjct: 17667 gaccagaatagcaacagaagaaccaacgt 1/639 



S^ore = 149 bits (75), Expect = 2e-32 
Identities = 75 75 (100*) 
Strand = Plus Minus 



ry : 728 gagaagaaaatgtaat tgc tgttcct t tgggaagtcgaagtgtgagaat tacagtgaaag ;87 
:t : 58620 gagaagaaaatgtaat tgctgttcc t ttgggaagt rgaagtgtgagaat tacagtgaaag d8d6 



juery: /83 gacc tgcccacc tc t 8 02 

! M | 1 | ! | I | j M M 
■b^ct: 58560 gacc tgcccacc tc t 58546 



Score = 105 bits (53), Expect = 3e-19 
Identities - 53/53 (100%) 
Strand - Pius / Minus 

vjery: 1210 agtgatggatttaaagagataatgccctatgaccact tccaacctc t tcctcg 1262 

I! I I M I I I I t I M I I ! I I I I ! ! I I I I I I I I I I ! I M i I I M I M ! I I ! ! I i ! 

;.cjct: 26045 agtgatggatt taaagagataatgccctatgaccacttccaacctc t tcctcg 25993 



>AC116157 . 3 . 94489 . 171225 
Length - 76737 

Score - 472 bits (238), Expect = e-129 
Identities = 238/238 (100%) 
Strand - Plus / Minus 

'"■uery : 3 6 4 gac tgccc tccagatgcagaagat t tcagagcccagcagtgc tcagcc tacaa tgatgtc 423 

I I I I I I I I I I I I I I I ! ! I I I ! I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I ! I I I I I 

Sbjct : 3187 gac tgccc tccagatgcagaagatt tcagagcccagcagtgc tcagcc tacaa tgatgtc 3 1 2 £ 
'"■upry : 414 cagta tcaggggca t tac ta tgaatggc 1 1 ccacgatataatga tec tgc tgccccgtgt 483 

II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I t I I I I I I I t I 

Sh jet : 3127 cagta tcaggggca t tac ta tgaatggc tt ccacgatataatga tec tgc tgccccgtgt 3 ObE 
-uery : 4 84 gcac tcaagtg teat gcacaaggacaaaac t tggtggtggagc tggcacc taagg tac tg 54 3 

I I I I I I I I I I I I I I I I I I I I I I ! I ! I i I I i I I i I I I I II I i I i I E I M i i i I II ! M ! I I 

Sbict : 3067 gcac t caagtgtca tgcacaaggacaaaac t tggtggtggagc tggcacc taaggtactg 3 0U£ 
■>uery : 5 44 ga tggaactcgt tgcaacacggactcct tggacatgt gtatcagtggca tc tg tcagg 601 

I ! ! I I I ! I II I I i I I I I I I i I i I I i I I I M M i I I I ' i M II I I I I I I I M I i I I I I I 

Sbjct : 3 0 07 ga tggaactcgt tgcaacacggactccttggacatgtgtatcagtggcatctg tcagg 2 95 0 



260 bits (131) , Expec: 
es = 131 131 (100%) 
Plus Minus 



= 8e-66 



agaccLcaagaaacactcgt:cagar.gaagacaaaga:GGcaac:ggga:gc::ccgccg ^ 

, . i i i i ! i i i M i i I i i i ■ 1 i i i ■ : ^ 

agacc:caagaaacact:cg:tcaga:gaagacaaaga:ggcaac:gggatgc::ggggcg 4 94 6 j> 



24 8 actggagtgactgctcccggacctgtgggggaggagcatcatat :c:c:gcggagatg:: j u . 
4 94 62 actggagtgactgctcccggacctgtgggggaggagcatcatattctctgcggagatgtt 4 94 0 j 



;ery: 3 C' 8 tcactggaagg 318 

Ml ; I ! I I \ i 
:jct: 49402 tgactggaagg 49392 



ScDre = 97.6 bits (49), Expect = 7e- 
Identities = 49/49 (100%) 
Strand ~- Plus / Minus 



i~>uery * 316 aggaa t tgt gaagggcagaacatt egg tacaagacatgcagcaa teat g 3 64 

I I I I M I I I I I I I I I I i I I I I I i I I I I I I I : I i I I I I I I I I I I M I I I 

Sbjct : 183^2 aggaattgtgaagggcagaacattcggtacaagacatgcagcaatcatg 18324 



>AC1 16157 . 3 .75713 . 94387 
Length - 18675 

Score = 260 bits (131), Expect = 8e-66 
Identities - 131/131 (100%) 
Strand = Plus / Minus 

.->-^>-y. 5 C ^8 caaacagtgggctgcgatcggcaactgqgaagcaatgccaaggaggacaac tgtggagtc 65/ 

I I III I I ! I II I I I I I M I ! I I I i i i ! I ! i ! ! ; ! ! ! ! : ! M ! M ; ! i ! I I i M I ! : I i M 

Sbjct: 3634 caggcagtgggctgcgatcggcaactgggaagcaacgccaaggaggacaaccgtggagtc jd/d 
o-erv: 6S8 tgtgccggcgatggctccacctgcaggcttgtacggggacaatcaaagtcacacgtttct 717 

M I I I I I I I I I I I I I ! I I I I I I I I M I M I I I I I M I I I I I I I I I M I I I I I M M I I I i 

Sbjct : 3574 tg tgccggcgatggc tccacc tgeagget tgtacggggacaatcaaagtcacacgtt tc t j^5 
Query: 718 cctgaaaaaag 728 

i M M M M i I 

Sb~ct: 3 514 cctgaaaaaag 3504 



> r ~ C 08~~38. 10. 149851. 181632 
Length = 31~82 



244 bits (123), Expect = 5e-61 
es = 12 3 12 3 (100%) 
Plus . Plus 



ccttccc gagtttgeactttctcct 12 6 
ccttcccgagt t tgcactt tctcct 922 



cagagaagtt t tc tggaagacacaacaggggagcagt tcctcac t tat cgctat ga tgac 186 

I I I E M I ! I I I I I I I I I I I I M I I I i I I i I I : I M I i i I I i I I M I I : i I I ! i ! I M ; I : 

cagggaagtt etc tggaagacacaacaggggagcagt tcctcac t tat cgctat ga tgac 923 



6 . cagaccacagctgagaaatctcctggagc ttattt 
916 8 cagaccacagc tgagaaa tc tec tggagee ta 1 1 1 



9228 



Query: 187 cag 18 9 
3b jet: 9288 cag 9290 



>AC087738 .10.1. 149750 

Length = 149750 

Score = 137 bits (69), Expect - 8e-29 
Identities = 69/69 (100%) 
Strand = Plus / Plus 



riuery : 1 atggc t tec tggacgagcccc tggtgggtgctgatagggatggtc t tcatgcactc tccc 6 0 

I I i I I I I I I ! I I I ! I I I ! ! ! I i ! ! ! ! ! I ! ! I i ! I I ! ! I ! I II ! I I ! I I I I ! ! ! ! ! ! I ! ! ! 

Sbj ct : 110273 a tggct tec tggacgagcccc tggtgggtgctgatagggatggtc t tcatgcactc tccc 110332 



Query: 61 ctcccgcag 69 

I I I I I I I I ! 

Sbjct: 110333 ctcccgcag 110341 



